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1.0  EXECUTIVE  SUMMARY 

This  report  documents  the  results  from  the  research  performed  under  the  contract  F49629-01- 
C-0027  entitled  “Robust  Adaptive  Control  of  UCAVs”.  Our  research  was  aimed  at  developing 
robust  nonlinear  adaptive  control  algorithms  for  UCAV  flight  control,  flexible  mode 
compensation  and  stabilization,  and  electromagnetic  actuators.  Specific  techniques  investigated 
were  Intemal-Model-Principle-Based  Nonlinear  Output  Regulation  and  the  Adaptive  Inverse 
Method.  For  comparison  purposes.  Neural  Networks  was  also  investigated. 

1.1  Robust  Adaptive  Nonlinear  Output  Regulation 

In  order  to  develop  nonlinear  adaptive  control  for  UCAVs,  Internal  Model  Principle 
Nonlinear  Output  Regulation  (IMPNOR)  technique  was  investigated.  This  controller  design 
technique  was  applied  to  structural  mode  filter  design,  and  UCAV  adaptive  flight  control,  and 
the  performance  was  evaluated  for  implementation. 

For  the  structural  niode  filter  design,  the  classical  notch  filter  is  often  used  in  practice.  The 
draw  back  to  this  approach  is  that  the  frequency  to  be  suppressed  needs  to  be  known  exactly, 
otherwise  the  filter  would  miss  the  target  frequency  and  fail  to  suppress  it.  In  addition,  once  the 
filter  has  been  designed,  the  target  frequency  would  be  fixed  and  cannot  accommodate  any  other 
frequencies.  As  will  be  discussed  in  the  next  section,  the  frequency  that  needs  to  be  attenuated 
could  change  depending  on  maneuvers  the  aircraft  performs,  particularly  for  flexible  aircraft 
such  as  UCAVs.  Hence,  the  goal  was  to  develop  an  adaptive  method  to  dynamically  filter  out 
the  undesirable  effects  of  the  structural  modes  on  the  flight  control  sensors.  As  will  be  discussed 
in  Section  3,  the  IMPNOR  method  was  employed  to  design  a  control  law  that  tracks  the 
reference  command  for  take-off  and  landing  while  suppressing  the  undesirable  harmonic  signals 
whose  frequencies  were  not  known  a  priori.  The  simulation  results  show  that  simultaneous 
command  tracking  and  disturbance  rejection  were  achieved  successfully  by  the  IMPNOR 
method. 

For  the  UCAV  adaptive  flight  control,  we  applied  the  IMPNOR  method  to  the  X-45A  linear 
model  and  designed  flight  control  laws  which  accommodate  surface  failures  and  nonlinear 
modeling  errors.  Both  inner-loop  and  outer-loop  responses  were  evaluated  with  and  without 
surface  failures.  The  IMPNOR  method  tracked  the  reference  command  successfully  in  spite  of 
the  surface  failures.  To  assess  the  performance  of  the  resulting  IMPNOR  control  laws,  an 
alternative  method  using  Neural  Networks  (NN)  was  applied  to  the  same  linear  model  and  the 
results  were  compared.  Both  IMPNOR  and  NN  controllers  seemed  to  track  the  command  well. 
The  details  are  discussed  in  Section  4. 

1.2  Adaptive  Control  of  Electromagnetic  Actuators 

The  Adaptive  Inverse  Control  method  developed  by  G.  Tao  and  P.  Kokotovic  was 
investigated  to  design  a  controller  for  an  electromagnetic  actuator  (EMA).  Specifically,  we 
applied  the  method  to  compensate  the  nonlinear  dynamics  that  appear  in  the  Miniaturized 
Munition  Technology  (MMT)  actuator  subsystems  known  as  dead-zone.  Coulomb  friction,  and 
quantization  errors.  The  simulation  results  showed  the  Adaptation  Inverse  Control  method 
performed  better  than  a  detuned  fixed  inverse  controller  or  a  controller  with  no  inverse  model 
associated  with  it.  However,  the  Adaptive  Inverse  Control  method  also  introduced  chattering, 
slow  adaptation  process,  and  less  robustness  to  larger  magnitude  of  initial  parameter  estimation 
errors.  To  alleviate  these  problems  we  considered  the  IMPNOR  technique  as  an  alternative 
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method,  and  applied  it  to  MMT  actuator  subsystems.  The  simulation  results  showed  that 
IMPNOR  resolved  the  chattering  and  robustness  issues  while  tracking  the  command  in  the 
presence  of  harmonic  disturbances.  The  details  are  discussed  in  Section  5. 

1.3  Aircraft  Departure  Analysis/Prediction  Tool 

We  investigated  a  robustness  analysis  method  appeared  in  [10]  in  an  attempt  to  develop  an 
Aircraft  Departure  Analysis/Prediction  Tool.  We  also  visited  Dr.  R.  Murray  and  Dr.  J.  Doyle  at 
California  Institute  of  Technology,  who  are  two  of  the  co-authors  of  [10]  to  discuss  the  method. 
After  the  discussion,  and  further  investigation,  we  decided  that  the  method  is  not  mature  enough 
at  the  present  time  for  technology  transition,  and  that  we  will  not  proceed  any  further  on  this 
task. 

1.4  Research  Collaboration  with  Universities 

Throughout  the  entire  contract  period  of  performance  for  this  research  program,  Boeing  and 
Washington  University  (Dr.  C.  Byrnes  and  Dr.  A.  Ishidori)  had  a  close  working  relationship  and 
shared  the  interest  in  applying  the  Internal  Model  Principle  Nonlinear  Output  Regulation  Control 
technique  to  real  aircraft  system  in  an  effort  to  transition  the  technology  emanating  from  this 
research  program.  The  research  collaboration  between  Boeing  and  Washington  University 
culminated  in  publication  of  [1 1]. 

We  also  had  a  dialogue  for  potential  research  collaboration  with  Dr.  Murray  at  California 
Institute  of  Technology  (CalTech)  in  the  area  of  robustness  analysis  techniques.  Boeing  and 
CalTech  had  a  technical  coordination  meeting  where  both  parties  expressed  interest  in 
collaboration,  and  decided  that  we  keep  searching  for  research  common  ground  by  keeping  an 
open  dialogue. 

Boeing  also  had  research  collaboration  with  Dr.  Hovakimyan  at  Virginia  Polytechnic 
Institute  and  State  University  in  the  area  of  Air  Collision  Avoidance  problem  from  the  Control 
Theory  point  of  view.  This  research  collaboration  was  culminated  in  publication  of  [12]. 
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2.0  INTRODUCTION 

Affordability  and  safety  have  become  the  prime  focus  for  today's  defense  research.  New 
Unmanned  Combat  Air  Vehicles  (UCAVs)  are  being  developed  to  significantly  reduce  the  cost 
compared  to  existing  manned  systems.  Smart  GPS  guided  weapons  are  being  developed  to 
improve  mission  effectiveness  and  reduce  costs  per  target  kill.  To  answer  to  these  challenging 
demands  and/or  needs,  our  research  was  centered  around  UCAVs  and  advanced  munitions  such 
as  Joint  Direct  Attack  Munitions  (JDAM)  and  the  Small  Smart  Bomb  (250  lb  weapon  under 
development),  and  the  efforts  were  focused  on  the  development  of  new  control  algorithms  that 
would  improve  flight  safety,  mission  reliability,  and  cost  reduction  for  UCAVs  and  these 
advanced  weapon  systems. 

The  specific  areas  of  primary  investigation  under  this  AFOSR  program  were: 

•  Robust  adaptive  control  algorithms  for  flight  control 

-  Robust  adaptive  control  laws 

-  Adaptive  aeroservoelastic  compensation  for  flexible  aircraft 

•  Adaptive  control  compensation  for  electric  actuators 

•  Aircraft  departure  analysis  and  prediction  tools 

These  investigations  were  performed  using  new  theory  on  robust  output  regulation  for 
nonlinear  systems  being  developed  at  Washington  University  in  St.  Louis  (Byrnes  and  Isidori), 
robust  adaptive  control  methods  developed  at  UC  Santa  Barbara  (Kokotovic),  and  robustness 
analysis  of  nonlinear  systems  developed  at  California  Institute  of  Technology  (Murray). 

Robust  adaptive  output  regulation  [l]-[5]  offers  a  potential  to  enhance  flight  performance  and 
improve  robustness  to  uncertain  aerodynamics  and  structural  dynamics.  Our  research 
investigated  this  new  approach  [5],  and  compared  it  with  the  adaptive  neural  networks  approach 
using  an  X-45A  linearized  model.  One  of  our  goals  was  to  demonstrate  this  technology  in  a 
flight  control  application,  and  to  extend  it  to  design  an  adaptive  aeroservoelastic  compensator  for 
flexible  aircraft,  which  would  fulfill  a  critical  need  currently  not  present  in  industry. 

Our  research  on  adaptive  control  compensation  for  electric  actuators  is  directly  applicable  to 
both  UCAV  and  JDAM.  UCAV  is  using  electric  actuators  that  are  a  derivative  of  the  JDAM 
design.  Our  research  in  this  area  would  significantly  reduce  the  cost  of  these  actuators.  Each 
UCAV  uses  18  actuators,  and  each  JDAM  uses  3  actuators.  Our  success  would  lead  to  reduced 
parts  count,  reduced  weight,  and  reduced  cost  for  each  actuator.  Multiplying  by  the  production 
numbers  for  JDAM,  and  the  potential  savings  from  UCAV  and  the  other  munitions  programs 
that  are  under  development,  the  potential  savings  are  enormous. 

In  addition  to  being  affordable,  UCAV  must  be  reliable  and  safe,  and  must  have  the  ability  to 
integrate  with  manned  aircraft  operations.  The  missions  to  be  performed  by  UCAV  are 
highlighted  in  Figure  2- 1 . 

Robust  adaptive  control  and  self-diagnostics,  prognostics,  and  health  monitoring 
technologies  are  needed  to  give  UCAV  the  same  reliability  and  safety  of  manned  aircraft. 
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Reactive  Suppression  Mission 
(Relocatable  Target  /  Adv.  ARM  Weapon) 


Preemptive  Destruction  Mission 
(Relocatable  Target  /  ASMM-F/R  Weapon) 


Figure  2-1.  UCAV  Suppression  of  Air  Enemy  Defense  Mission  Scenarios 


Autonomous  ground  operations,  robust  flight  control,  and  multi-vehicle  coordinated 
operations  are  being  designed  for  UCAV  to  minimize  operator  workload,  increase  mission 
reliability,  and  reduce  support  costs.  These  new  levels  of  vehicle  autonomy  introduce  new 
requirements  on  the  aircraft’s  control  system.  Sophisticated  contingency  management 
algorithms  must  replace  the  pilot's  ability  to  adapt  to  unforeseen  and  unpredictable  events  to 
guarantee  flight  safety  and  mission  reliability.  The  aircraft’s  control  system  must  be  able  to 
respond  to  these  contingencies.  Figure  2-2  illustrates  the  guidance  and  control  modes  for 
UCAV,  as  well  as  several  of  the  contingency  modes.  The  figure  illustrates  autonomous  ground 
operations  (auto  taxi  and  takeoff),  ingress  to  target  area,  mission  execution,  and  safe  return  to 
base. 
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GP44275002.ppt 


Figure  2-2.  UCAV  Guidance,  Control,  and  Contingency  Modes 
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Robust  adaptive  control  plays  a  critical  role  in  the  autonomous  control  ofUCAVs.  These 
algorithms  must  accommodate  unknown  or  uncertain  aerodynamics,  flight  control  actuator 
failures  and/or  battle  damage.  Our  research  addresses  the  need  for  flight  control  and  for  adaptive 
aeroservoelastic  compensation.  The  need  for  reconfigurable  flight  control  is  well  understood, 
and  alternate  solutions  to  this  problem  exist.  The  need  for  adaptive  aeroservoelastic 
compensation  for  flexible  aircraft  remains  unanswered. 

The  X-36  Agility  Research  Aircraft  shown  in  Figure  2-3  is  a  28%  scale  test  aircraft  used  to 
demonstrate  agility  and  handling  qualities  in  a  tailless  fighter  (unstable  in  multiple  axes).  The 
X-36  is  all  composite  aircraft  that  proved  to  have  structural  modes  that  depend  upon  flight 
condition.  Due  to  both  the  aerodynamic  loading  on  the  fuselage  and  wing  and  the  control 
surface  deflection,  the  structural  modes  varied  in  frequency  and  in  damping.  Designing 
compensation  to  attenuate  the  effects  of  the  structural  modes  on  the  flight  control  sensors  (gyros 
and  accelerometers)  proved  to  be  costly  in  time  and  manpower,  and  introduced  risk  into  the 
program.  The  resulting  compensation  used  filters  that  were  scheduled  with  flight  condition, 
including  control  surface  deflection. 


GP44275003.ppt 


Figure  2-3.  X-36  Agility  Research  Aircraft 

Key  in  making  the  Boeing  UCAV  design  affordable  has  been  the  incorporation  of  all-electric 
aircraft  technologies.  Existing  aircraft  use  hydraulic  actuation  systems  that  are  expensive,  heavy, 
and  require  significant  maintenance.  The  Boeing  UCAV  design  currently  has  1 8  electro¬ 
magnetic  actuators  (EMAs).  Six  electric  actuators  are  used  to  control  UCAV's  flight  control 
surfaces,  as  shown  in  Figure  2-4,  with  2  actuators  used  to  control  the  thrust  vectoring  system, 
and  additional  10  actuators  to  control  the  nose  and  main  landing  gear,  gear  doors,  steering 
system,  and  weapon  bay  doors. 

UCAV  has  baselined  identical  actuators  for  all  18  EMAs.  This  in  itself  (using  the  same 
actuator  everywhere  on  the  aircraft)  greatly  reduces  aircraft  costs  and  logistics  footprint.  These 
UCAV  electric  actuators  are  manufactured  by  HR  Textron  under  subcontract  to  Boeing.  The  HR 
Textron  UCAV  design  is  a  derivative  of  the  Joint  Direct  Attack  Munition  (JDAM)  actuator 
design,  and  uses  common  parts  and  manufacturing  processes  and  assembly.  The  large 
production  rate  from  the  JDAM  program  makes  the  HR  Textron  UCAV  actuator  very  attractive 
from  an  affordability  standpoint. 
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Deflection  Ranges 
•  Elevens  +/-  40  deg 
•Yaw  Vectoring  +/-  20  deg 


Pitch  control :  Symmetric  deflection  of  elevons 


Roll  control:  Differential  deflection  of  elevons 


Yaw  control:  Nozzle  vectoring  +  ‘Crow-Mixing’ 
of  outboard/mid  elevons 


GP44275004.ppt 

Figure  2-4.  UCAV  Flight  Control  Surfaces  and  Actuators 

The  UCAV  actuator  design,  shown  in  Figure  2-5,  is  an  electromechanical  design,  in  which 
two  brushless  DC  motors  on  a  common  shaft  drive  a  ball  screw  to  convert  rotary  motion  into 
linear  motion.  When  assembling  the  ball  screw  in  the  actuator  unit,  small  ball  bearings  are 
placed  in  between  the  ball  screw  and  the  ball  nut.  After  the  actuator  is  assembled  the  backlash  is 
measured.  If  the  system  is  too  sloppy,  the  unit  is  disassembled  and  slightly  larger  balls  are 
inserted  to  remove  the  slop.  This  is  labor  intensive  and  expensive.  We  would  like  to  assemble 
the  unit  once  and  for  all,  and  accept  whatever  the  tolerances  are  there.  Allowing  larger 
tolerances  in  the  machining  of  the  ball  screw  could  further  reduce  actuator  costs.  Relaxing  these 
tolerances  will  lead  to  a  system  that  has  more  backlash,  however. 

Thermal  expansion  effects  also  complicate  the  design  of  the  actuator.  If  lower  cost  materials 
could  be  used  (that  have  greater  thermal  expansion),  then  costs  could  be  further  reduced.  These 
materials  are  also  easier  to  machine,  reducing  tooling  costs. 

The  two-DC-motor  design  shown  in  Figure  2-5  exhibits  the  following  characteristics: 

•  Stall  load:  2,500  lb 

•  Max  load  prior  to  back  drive:  3,500  lb 

•  Max  brake  hold  capability:  5,000  lb  (unpowered) 

•  4”  and  6”  total  stroke  configurations 

•  6  in/sec  no  load  rate,  4.5  in/sec  at  1,500  lb 

•  250  flight  hour  actuator  life 
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GP44275005.ppt 

Figure  2-5.  UCAV  Electric  Actuator 


•  28  VDC  and  135  VDC 

•  Peak  current:  12  amps  per  motor  (24  amps  per  actuator) 

•  Estimated  weight:  10  lb 

•  One  Brake,  dual  coil  (Reliability  if  one  fails) 

•  Two  DC  motors  torque  summed  on  common  shaft 

•  Gear  reduction  to  ball  screw  providing  linear  output 

This  type  of  EM  A  actuator  exhibits  different  types  of  nonlinearity  such  as  dead-zone, 
backlash,  and  Coulomb A^iscous  friction.  Our  research  focused  on  adaptive  compensation  to 
remove/negate  these  nonlinear  effects,  and  produced  an  actuator  controller  that  has  repeatable 
performance  from  unit  to  unit. 

Finally,  this  research  program  under  AFOSR  contract  addresses  fundamental  scientific  issues 
with  a  goal  of  rapid  transition  to  real  applications.  The  combination  of  “technology  push”  by 
basic  research  and  “technology  pull”  by  the  applied  programs  provides  specific  technology 
transition  objectives  and  paths.  This  work  is  directly  linked  to  programs  at  The  Boeing 
Company  and  US  Air  Force. 
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3.0  ROBUST  ADAPTIVE  NONLINEAR  OUTPUT  REGULATION 
-  An  Application  To  Structural  Mode  Filter  - 

In  this  section,  we  discuss  the  Internal  Model  Principle  Nonlinear  Output  Regulation 
(IMPNOR)  method  that  suppresses  the  effect  of  harmonic  disturbances  on  the  regulated  output 
of  a  system  to  be  controlled.  Controllers  which  are  optimal  in  an  LQ  senses  are  derived  for  the 
suppression  of  harmonic  disturbances  for  discrete-time,  finite-dimensional,  multivariable  linear 
systems.  These  controllers  are  shown  to  enjoy  some  of  the  robusmess  features  concomitant  with 
an  LQ  optimization  scheme. 

This  work  can  be  applied  in  a  variety  of  settings,  for  example  in  active  noise  control  which 
was  studied  in  [9]  for  scalar-input  scalar-output  systems  using  a  minimum  variance  performance 
measure.  The  suppression  of  harmonic  disturbances  also  arises  in  the  active  control  of  vibrations 
in  helicopters [6],  where  a  continuous-time  model  is  used  in  the  context  of  LQ  control  and 
Kalman  filtering. 

Our  interest  in  suppression  of  harmonic  disturbances  arose  in  the  development  of  feedback 
control  strategies  for  next  generation  aircraft  in  particular,  for  certain  aircraft  it  is  known  that  the 
wings  exhibit  slightly  damped  flexible  behavior  which,  when  excited  by  a  wind  gust,  produce  a 
disturbance  which  additively  corrupts  some  important  velocity  measurements.  The  frequencies 
of  the  slightly  damped  (exogeneous)  signal  generator  are  known,  but  the  wind  gust  may  be 
modeled  as  a  dirac  delta  function  which  forces  the  exogeneous  system  into  an  unknown  initial 
state.  The  control  objective  is  to  track  a  prescribed  trajectory  while  suppressing  the  disturbance 
produced  by  the  exogenous  system.  This  is  a  slight  modification  of  the  standard  problem  of 
output  regulation,  in  which  the  reference  trajectory  itself  is  also  assumed  to  be  generated  by  an 
exosystem.  As  we  shall  show,  this  modification  is  easily  accommodated  within  the  present 
framework  of  output  regulation.  However,  since  output  regulation  ensures  some  asymptotically 
stability  of  the  error  while  the  tracking  and  disturbance  rejection  problem  requires  a  finite  time 
horizon  in  practice,  we  have  assumed  the  exosystem  is  undamped  and  consists  of  a  finite  bank  of 
harmonic  oscillators. 

As  part  of  an  ongoing  research  effort,  we  are  developing  a  solution  to  the  problem  for  a 
nonlinear  system  which  incorporates  both  the  rigid  body  dynamics  and  aeroelastic  dynamics.  In 
this  section,  we  illustrate  our  use  of  the  internal  model  principle[8]  to  solve  this  problem  for 
continuous  time  linear  systems.  Specifically,  in  Section  3.2  we  introduce  the  basics  of  internal 
model  based  design  for  a  scalar-input,  scalar-output  system.  In  Section  3.3,  we  illustrate  this 
design  for  two  particular  systems  where  the  desired  trajectory  idealizes  a  take-off  and  landing 
maneuver.  In  Section  3.4  we  give  some  brief  remarks  about  the  internal  model  principle.  One 
remarkable  feature  of  this  principle  is  its  ability  to  systematically  produce  classical  design 
approaches  in  a  variety  of  particular  control  problems.  In  the  case  at  hand,  it  produces  a  notch 
filter  for  the  transfer  ftmction  from  the  disturbance  to  the  tracking  error,  with  notches,  or 
(blocking)  zeroes,  at  the  natural  frequencies  of  the  exogenous  system.  Of  course,  the  real  power 
of  this  method  is  its  ability  to  produce  nonclassical,  multivariable  designs.  Indeed,  in  Section  3.4 
we  consider  three-input,  three-output,  minimum  phase  systems  with  vector  relative  degree  (2;  2; 
2).  Interestingly,  the  internal  model  based  controller  design  leads  to  a  Linear  Matrix  Inequlaity 
(LMI)  constraint  on  the  design  parameters,  yielding  a  convex  problem  which  is  easily  solved. 
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3.1  The  Case  of  a  SISO  System 

Consider  a  linear  single-input  single-output  system  having  relative  degree  two  and 
asymptotically  stable  zero  dynamics.  Any  system  of  this  kind  can  always  be  put,  after  a  suitable 
change  of  coordinates  in  the  state  space,  in  the  form 

z  =  Az  +  B1X1  +  B2X2 

11  =  X2 

12  =  Cz +D1X1  A  1^2x2  +  Ku 

y  =  (3.1) 

in  which  ^  0  and  the  matrix  A  is  Hurwitz. 

It  is  well-known  that  single-input  single-output  systems  having  relative  degree  two  and  stable 
zero  dynamics  can  always  be  asymptotically  stabilized  by  means  of  a  proportional-derivative 
output  feedback,  i.e.  by  means  of  a  control  law  of  the  form 

u=kxy  +  k2y .  | 

It  is  also  well-known  that  a  proportional-derivative  output  feedback  can  be  used  to  solve  the 
problem  of  having  the  output  _y(t)  to  track  a  prescribed  trajectory  yrejiO,  if  the  latter  and  its  first 
and  second  derivatives  are  available  in  real-time. 

In  this  section,  we  describe  how  the  problem  of  tracking  a  prescribed  trajectory  yref  (0  can  be 
solved,  for  a  system  of  the  form  (3.1),  in  case  the  information  about  the  output  y  (i.e.  xi)  is  noise- 
free,  but  the  information  about  the  rate  of  change  y  (i.e.  ^2)  is  corrupted  by  harmonic  noise,  i.e.  if 
only  x\  and  the  quantity 

^2, noisy  ~  X2  Ad,  2^ 

where  d  is  a  superposition  of  sinuosoidal  functions  of  time  that  are  available  for  feedback. 

Define 


ei  =  y-Vtd  = 

€2  =  y  -  Vtet  =  2:2  -  yref 

in  which  ei  is  indeed  the  tracking  error  and  e2,  its  rate  of  change.  Then,  system  (3.1)  can  be 
rewritten  as 

z  =  Az  + Bi{ei  +  yt^t)  + B2(e2  +  yt^t) 

61  =  62 

62  =  Cz  A  Dl{ei  +  Vref)  -f  D2 (^2  A  y ret)  A  Ku  —  j/ief  . 


Let  Zrcf  be  any  solution  of 


and  define 


^ref  —  Azi^f  A  Biyi^(A  B2yiet 


(3.3) 

(3.4) 


6^  =  2  -  Z,^i  . 
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Then  system  (3.3)  can  be  rewritten  as 

=  Ae,  +  Biei  +  8262 
ei  =  €2 

62  =  Ccf Diici  +  D2{t2  Vrci) Ku  -  Vtti  ■ 


Choose  now  a  control  of  the  form 

^  ~  ^(3^1  >  ^2,noisy  >  t/ref)  yref>  y'ref) -^ret) 

—  ‘j^\  (J^i  “f  ^OVref  -^Syref  “b  j/ref  -^1(^1  Vref)  ^2(^2, noisy  t/ret)  ”b 


in  which  v  is  an  additional  input  to  be  determined  later.  With  this  control  u,  system  (3.5) 
becomes 


where 


+  SiCi  +  £2^2 

€1  =  62 

62  =  Cej^- kiCi- k2e2- k2d+v  , 


(3.6) 


k2  —  k2  —  D2  . 


It  is  known  that  d(t),  which  is  superposition  of  sinusoidal  functions,  can  be  put  in  the  form 

d{t)  — 


with  w(t)  solution  of  the  homogeneous  equation 


in  which  S  is  a  matrix  of  the  form 


and 


w{t)  =  S'ty(t)  1 

;  0 

i 

1 

0 

0 

Co 

to 

0 

\o  0 

sj 

.  /  0 

) 

(3.7) 


Under  our  assumption  that  the  zero  dynamics  is  either  vacuous  or  is  asymptotically  stable, 
we  know  that  any  transmission  zeros  of  the  plant  to  be  controlled  lie  in  the  left  half  plane.  Since 
the  eigenvalues  of  the  exosystem  (3.7)  that  generate  the  disturbance  lie  on  the  imaginary  axis,  we 
see  that  the  standard  nonresonance  conditions  for  solution  of  the  regular  problem  are  satisfied. 

Matters  being  so,  the  output  regulation  problem  for  y ref  =  0  is  solvable,  when  the  additional 
input  V  is  chosen  as  the  output  of  an  internal  model  of  the  form 

i  =  Si-Ghe 

V  = 
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where  G  is  a  vector  of  free  design  parameters.  We  claim  that  this  also  yields  a  solution  for 
nontrivial  yref.  Indeed,  introduction  of  v  yields  a  closed  loop  system  which,  changing  the 
variable  ^  into 

X  =  ?  -  hw 

is  described  by  set  of  equations 

=  Aejf  +  B\ei  +  B'2e'i 
X  =  Sx-Gkiei 
—  ^2 

62  =  Ge^  +  -  he2  . 

At  this  point,  it  remains  to  show  that,  if  kj,  k2 (or,  what  is  the  same,  k2),  and  G  is 
appropriately  chosen,  system  (3.8)  is  asymptotically  stable.  If  this  is  the  case,  in  fact,  we  have  in 
particular 


lim  ei{t)  =  0  , 

which  is  the  required  tracking  goal,  regardless  of  the  fact  that  the  measurement  ofy  is  corrupted 
by  noise. 

To  provide  our  stability  result,  set 

...  -&J) 

and 

=  •  G'^) 

where  the  partitions  indicated  are  consistent  with  those  of  S. 

Then,  the  following  result  holds. 

Proposition  3.1:  Consider  system  (3.8).  Choose  Gi,. . .  ,Gr  in  such  a  way  that 

<i>TST^G,>0,  i  =  l,...,r 

j2^fSr^G,<l 

^=1  (3.10) 

which  is  always  possible.  Ifki  >  0,  k2>  0  are  sufficiently  large,  the  system  is  asymptotically 
stable. 

The  proof  of  this  Proposition  is  a  straightforward  consequence  of  the  following  Lemma, 
proven  for  instance  in  [7]. 
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Lemma  3.2:  Consider  the  matrix 


(A 

0 

•  0 

0 

0 

•  0 

0 

Pi 

J  = 

0 

0 

• 

0 

Pr 

0 

0 

•  0 

0 

Fo 

Qi 

•  Qr 

Qo 

-k) 

PiQi 

<0 

for  all  i 

=  0,. 

..,r 

there  exists  k*  such  that,  ifk>  k*,  the  matrix  J  is  Hurwitz. 

To  provide  the  Proposition,  we  change  coordinates  in  system  (3.8)  using 

"f" 

X  =  x-kiS-^Oe, 


(3.11) 

(3.12) 


which  yields 

4  =  +  ^2)62 

^  =  Sx-hS-^Ge, 

=  ^2 

62  =  ae^  +  ^f^x-(OA-^B^+kl(l-^'^S-^G))ei-^lc2e2  , 

i.e.  a  system  of  the  form 

x  =  Jx 

with  J  a  matrix  having  the  same  structure  as  the  matrix  J  in  the  Lemma  if  we  set 
B  =  (A-IB1+S2) 

Qi  =  i  =  h...,r 

Pi  -  -kiSr^Gi  i  = 

Qo  =  -(CA-^Bi  +  h(l-i!^S-^G)) 

Fo  =  1  . 

k  =  k2 


Thus,  conditions  QBi  <  0,  for  /  =  1, . . .,  r,  of  the  Lemma  become 

ki<i!fS-^Gi>0 

while  conditions  QoPo  <  0  of  the  Lemma  becomes 

r- 

CA-^Bi  +  ki{l  -  ^^S-^G)  =  CA-^Bi  +  Ai(l  -  iffSr^Gi)  >  0  . 

Indeed,  if  G  is  such  that  (3.9)  and  (3.10)  hold,  the  conditions  of  the  Lemma  can  be  met  for 
large  A:i  >  0  and,  consequently,  for  large  ki  >  0  the  matrix  J  is  Hurwitz. 
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3.2  A  Numerical  Example  of  Take-Off  and  Landing 

In  this  section  we  consider  some  specific  numerical  examples.  In  our  first  example  we 
consider  the  ease  in  which  there  is  no  z,  i.e.  no  zero  dynamics,  and  that  £>i  =  £>2  -  0,  =  1  so  that 

the  plant  reduces  to  the  double  integrator.  For  the  second  example  we  consider  a  one 
dimensional  stable  zero  dynamics. 


For  both  examples  we  assume  that  disturbanee  d  is  given  as  a  sum  of  sinusoids  with  two 
different  frequencies, 


d{t)  =  Ml  sm(a'it  +  ^i)  +  M2  sin(Q;2t  + 1^2)- 


In  this  ease  we  ean 

5'i  = 


0  2" 
-2  0/  ’ 


•S'2 


0  10 

-10  0 


s  = 


Si  0 
0  ^2 


and  consider 

U!  =  Sw,  w(0)  —  Wq: 

Then  we  chose 

Example  3.3:  In  our  numerical  examples  we  have  chosen  “1  =  2,  q;2  =  10 

'^T  =  (o  2) 

25 
0 

\5oy 

so  that 


^'r=(o  2), 

1i)0  = 


d{t)  =  50cos(2f)  +  100  cos(lOt). 

Next  we  choose  Gu  G2,  ki  and  k2  so  that  the  eonditions  of  Proposition  3.1  are  satisfied. 
In  the  numerical  example  we  have  set  A:i  =  10  and 


which  gives 

=  0.2,  ^2S2^G2  =  0.2 

and 

so  the  conditions  of  Proposition  3.1  are  fulfilled. 
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We  proceed  to  chose  A:2  >  0  so  that  the  matrix  J  in  (3. 1 1 )  is  Hurwitz.  Our  choice  of  k2  will  be 
based  on  a  root  locus  design.  Since  we  do  not  have  a  z  component  in  this  example  the  matrix  J 
can  be  reduced  by  deleting  the  first  column  and  the  first  row.  Then  we  setk2  =  k=0  and  define 
the  resulting  matrix  to  be  Jo,  i.e., 


0 

0 

Jo  = 

0 

0 

Pr 

0 

.  0 

0 

-Po 

•  Qr 

Qo 

0/ 

Then  we  define  matrices 

=  (0  0  0  0  0  1) ,  e  =  (0  0  0  0  o  i) 

and  consider  the  system  {Jq,B,C)  with  feedback  law  u  =  -k2y  and  transfer  function 

e(sJ- 


Notice  that 

j  =  Jo-fe2®e,  I 

and,  more  importantly,  the  eigenvalues  of  J  are  the  closed  loop  poles  of  this  system.  If  k|  >0  our 
Proposition  predicts  that  for  large  values  of  k2  the  matrix  J  is  Hurwitz.  For  ki=  10  we  plot  the 
locus  of  the  roots  of  the  characteristic  polynomial  of  J,  viewing  k2  as  a  gain  parameter.  For  large 
k2,  five  of  the  6  roots  should  approach  five  (zeros)  on  the  imaginary  axis,  at  0,  ±2i,  ±10i,  while 
the  6-th  one  goes  to  -  oo. 


<x=2.  CL  =  V  ,  3U,  K2=  y.ttb 


Figure  3-1.  Plot  of  Root  Locus 
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In  this  example  we  take  a  simplified  model  of  aircraft  take-off  and  landing  with  a  reference 
signal  given  by 

{2t,  0<t<50 

100,  50  <  t  <  100  . 

300  -  2t,  100  <  t  <  150 


(left)  plot  of  y^f 

Figure  3-2.  yrefandx2, 


time 


(right)  plot  of  X2.n  =  X2  +  d 
in  Example  3.3 


Notice  thatyrcf,  is  not  differentiable  at  t  =  50  and  t  =  100  as  it  would  actually  be  in  practice. 
Indeed  we  have  not  actually  usedyrcf  as  given  above  but  rather  we  have  used  a  Hermite 
interpolation  to  round  the  comers  at  these  points.  Since  it  serves  no  particular  good  to  produce 
these  more  complicated  formulas  we  do  not  present  them  here. 


(left)  plot  of  ei“’  =  xi“*  -  /re/*’  (right)  plot  of  Xi  and  /ref 

Figure  3-3.  ei,  Xi,  and  yref  in  Example  3.3 
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Example  3.4:  In  our  second  numerical  example  consider  a  system  with  a  one  dimensional  stable 
zero  dynamics.  In  this  case  we  have  set  ^4  =  -I,  C  =  1,  Z)i  =  1  and  £>2  =  1 .  The  relative  values  of 
B]  and  Bz  play  a  large  role  in  the  location  of  the  closed  loop  poles  so  we  present  a  “root  locus 
plot”  in  two  different  cases:  B\  =  1/4,  B2  —  1/2  and  B]  =  1/2,  B2  =  1/4.  Also  for  this  example  we 
compute  the  reference  zero  dynamics  trajectory  Zrcf  satisfying  (3.4)  for  a  given  initial  condition 
2rcf(0)  ==  5.  Once  again  we  have  set  a]  =  2,  02-  \0,  and  we  have  taken 

^^=(l  1): 

25 
0 

Vloy 

d{t)  =  25V5sin(2t  +  7r/4)  +  10%/5sin(10t  +  7r/4). 

Next  we  chose  G\,  G2,  ki  and  ^2  so  that  the  conditions  of  Proposition  3.1  are  satisfied. 

In  the  numerical  example  we  have  set  k]=  20  and 


^r=(i  i)> 


Wo  = 


so  that 


which  gives 
and 


-  .4  <  1  =  0.2,  -  0.2 

so  the  conditions  of  Proposition  3.1  are  fulfilled. 

Just  as  in  the  previous  example  we  choose  A:2  >  0  so  that  the  matrix  J  in  (3. 1 1)  is  Hurwitz.  In 
the  present  case  more  care  must  be  taken  in  making  our  choice  for  k2  as  is  seen  in  the  root  locus 
plots  in  Figure  3-4.  Just  as  in  Example  3. 1 ,  if  we  first  set  ^2  =  A:  =  0  in  J  and  define  the  resulting 
matrix  to  be  JO,  i.e., 


0  • 

0 

0 

0 

• 

0 

0 

Pi 

Jo  = 

0 

0  ■ 

0 

Pr 

0 

0  • 

0 

0 

Po 

Qi  • 

Qo 

0/ 

Then  we  define  matrices 

0 

II 

€0 

0  0  0 

0 

0  1), 

e  = 

(0 

0  0 

0001) 

and  consider  the  system  {Jo,  B,  Q  with  feedback  law  u 

= . 

k2y  and  transfer  function 

Q{sl  - 
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Notice  that 


J  =  3o-k2'Be, 

and,  more  importantly,  the  eigenvalues  of  J  are  the  closed  loop  poles  of  this  system. 


(Left)  B  =  +  Bz)  >  0,  ki  =  15 


10 

6 

0 

-5 

-10 

-3.5  -3  -2.5  -2  -1.5  -1  -0.5  0 

(Right)  =  (A'^Si  +  82)  <  0,  ki  =  15 


1 - 1 - 1 - 1 - 1 - T - 

i . ' . \  . . > . ! . -i . 


Figure  3-4.  Root  Locus  in  Example  3.4 


Unlike  the  first  example  more  care  must  be  exercised  in  choosing  k\  and  kz.  As  an  example, 
if  we  choose  k\  >  0,  wet  -1,  C  =  1,  then  the  sign  of  the  real  parts  of  the  eigenvalues  of  Jo 
depend  on  whether  B  =  +  Bz)  is  greater  than  or  less  than  zero. 

If  iti  >  0  our  Proposition  still  predicts  that  for  large  values  of  kz  the  matrix  J  is  Hurwitz.  For 
it]  =  15  and  for  B\  =  1/4,  Bz  =  1/2,  B=  {A'^B\  +Bz)>0  for  which  the  some  of  the  open  loop  poles 
are  in  the  right  halfplane  while  for  ki  =  15  and  for  B\  -  1/2,  Bz  -  1/4,  B  =  {A'^Bx+B'^  <  0  all  open 
loop  poles  are  in  the  left  half  plane  for  large  kz,  five  of  the  7  closed  loop  poles  approach  the  five 
(zeros)  on  the  imaginary  axis,  at  0,  ±2/,  ±10/,  while  the  6-th  one  goes  to  -oo  and  the  7-th 
approaches  the  zero  at  -1. 

In  this  example  we  take  a  simplified  model  of  aircraft  take-off  and  landing  with  a  reference 
signal  given  by 


2L 


J/t6f(t)  = 


<100. 

[soo  -  2t, 


0  <  t  <  50 
50  <  t  <  100  . 
100  <  t  <  150 
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Figure  3-5.  X2,nand  ei  in  Example  3.4 


(Left)  Xi  and  /ref  (Right)  z 

Figure  3-6.  Xi,  Vref,  and  z  in  Example  3.4 


3.3  Remarks  on  the  Internal  Model  Principle 

One  of  the  remarkable  features  of  the  Internal  Model  Principle  is  its  ability  to  generate,  in  a 
systematic  fashion,  classical  control  designs.  More  importantly,  it  can  also  be  used  in 
multivariable,  nonlinear  and  infinite-dimensional  contents. 

As  a  first  illustration  of  this  point,  consider  a  relative  degree  1,  minimum  phase  scalar- input 
scalar-output  system 

z  =  Az  +  By 
y  =  Cz  +  Dy  +  ku  \ 

vfhtxty noisy  =  J  with  d(t)  being  an  unknown  constant.  Setting 

e  =  y  -  yref 
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we  obtain  the  relative  degree  one  analog  of  (3.3)-(3.6).  In  this  case,  the  constant  disturbance 
takes  the  form 

d  -  vw 

where  w  is  the  state  of  the  exogenous  system 

=  0.  i 

This  generates  the  internal  model 

V  =  'tjje 

which  is  a  proportional-integral  (PI)  controller.  Stability  of  the  closed  loop  system 

=  Ae;,  +  Be 
X  =  -Ge 

e  =  Csj,  +tl)x-  ke 

is  guaranteed,  whenever  ipG  >  0  by  Lemma  3.2  (see  [7]). 

As  a  second  illustration,  consider  the  system  described  in  Example  3.1.  The  disturbance  is  a 
sum  of  two  sinusoids  having  frequencies  ai  =  2  and  a2=  10.  Figure  3-7  below  depicts  the  Bode 
magnitude  and  phase  plots  of  the  closed-loop  transfer  function  from  the  disturbance  to  the 
tracking  error,  obtained  by  using  an  internal  model  based  design.  The  magnitude  plot  is,  of 
course,  that  of  a  typical  notch  filter  having  notches  (or  transmission  zeroes)  at  precisely  2/  and 
10/,  reflecting  the  fact  that  the  filter  absorbs  sinusoids  at  those  frequencies. 

Bode  Diagrams 


From:  0(1) 


Frequency  (rad/sec) 

Figure  3-7.  Bode  Plots  of  the  System  in  Example  3.1 
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3.4  The  Case  of  a  MIMO  System 

In  the  following  we  briefly  sketch  how  the  analysis  developed  in  section  3  can  be  easily 
extended  to  deal  also  with  multi  inputs-multi  outputs  systems.  We  consider  the  class  of 
minimum-phase  linear  systems  with  three  inputs  and  three  outputs  with  vector  relative  degree  [2, 
2,  2],  As  is  well-known,  this  class  of  systems  can  always  be  put,  after  a  suitable  change  of 
coordinates,  in  the  normal  form 

z  = 

®i  = 

y  = 

where  IRr 

®1  =  (  2:11  X12  ®13  ®2  =  (  3:21  2^22  2:23  )^  €  R®  ,  . 

IR^  is  the  vector  of  control  inputs  and  IR^  of  controlled  outputs,  with  the  matrix  A  which  is 
Hurwitz  and  the  high  frequency  gain  matrix  IR^  x  IR^  which  is  non  singular.  Our  goal  is  to 
force  the  output  vector  y  to  asymptotically  track  a  vector  of  known  reference  signals. 

(  i’ref.i  yret,3 

with  the  design  of  a  proportional-derivative  output  feedback  control  law.  Similarly  to  section  4 
the  challenging  aspect  of  this  apparently  simple  tracking  problem  is  given  by  the  presence  of 
additive  noise  on  the  velocity  measures.  More  precisely  we  suppose,  as  above,  that  the  measure 
of  the  position  vector  x/  is  noise-free,  but  the  measured  velocity  vector  is  corrupted  by  additive 
harmonic  noise,  namely 

®2, noisy (0  =  ®2 (0  +  ^(0  d{t)  =  {  di{t)  da(t) 

where  the  signals  di,  i  =  1,2,3,  are  given  as  superposition  of  sinusoidal  functions  of  time.  In 
particular  the  signals  di(t),  i=  1, 2, 3,  are  thought  as 

di(t)  —  with  Wi^t)  =  Si‘Wi(t)  Wi  € 

where  Si  is  a  block-diagonal  square  matrix  of  dimension  2r,  which  describes  a  bank  of  r/ 
oscillators  at  different  frequencies. 

By  mimicking  the  design  methodology  of  the  SISO  case  we  define  by  Zrcf  any  solution  of 

"b  ”b  .^2^re£  : 

and  we  choose  the  preliminary  control  law 

«  =  (-CZtef-  -  i?2yref +  yref  +  '^)  ^ 

where  v  is  a  vector  of  residual  control  inputs.  Defining 

ei  =  y  -  yret  =  ®1  - 
^2  ^  y-  i/ret  =  ®2  -  Vtef 


Az  +  Bixi  +  B2X2 
®2 

Cz  +  A®!  +  B2X2  -b  Ku 

(3.14) 
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and  z  =  z  -  Zrcf,  simple  computations  show  that  system  (3.4)  under  the  feedback  (3.15),  in  the 
new  error  coordinates,  reads  as 

z  =  Az  + 6161+8262 

ei  =  62 

62  =  CZ  +  D262  +  V  (3.16) 

The  design  of  the  vector  of  inputs  v  able  to  asymptotically  stabilize  the  system  (3.16),  and 
hence  to  solve  the  tracking  problem  can  be  easily  carried  out  by  means  of  a  three-steps  procedure 
in  which,  at  each  step,  just  one  control  variable  v,  designed,  following  exactly  the  design 
methodology  illustrated  in  the  SISO  case. 

To  sketch  how  this  procedure  can  be  carried  out  consider,  for  the  first  control  input  v/,  the 
following  dynamic  law 

Vi  =  "^11(2^11  ~  ytef,l)  ■“  ^21  (2^21, noisy  “  ytef.l)  T 

=  —Alien  -  Ic2ie2i  + —  ^21 (3-17) 

in  which  ku,  k2i  and  the  entries  of  Gi  are  design  parameters.  Simple  computations  show  that, 
having  defined 

Xl  =  Cl  -  A2l2i)l  , 

system  (3.16)  with  the  partial  control  law  (3.17)  can  be  rewritten  as 

I  =  Az  + 8161  + 8262 
=  62 

I2  =  Cz  +  D2e2  +  v 


(3.18) 


where 


z  =  {  Z  Xl  «11  «21  )' 

the  matrix  A  is  defined  as 


A  = 


{  A 
0 
0 


0 

Si 

0 


Gi  —  ^ 

) 

V 

B'2 

\ 

-Giku 

0 

0 

1 

-All 

-(A21  - 

V  =  [  V2  V2  y 


(3.19) 


and  B[, B'^ , C', are  suitably  defined  matrices 


Now  it  can  be  easily  realized  that  it  is  possible  to  chose  k\\,k2\,  and  Gi  so  that  the  matrix  A 
is  Hurwitz.  To  this  end  just  note  that  the  matrix  characterizing  system  (3.8)  has  the  same 
structure  of  (3.19).  Hence,  by  Proposition  3.1,  we  can  claim  the  existence  of  a  vector  G|  such 
that  for  suitably  large  An  and  A22,  the  matrix  A  is  Hurwitz. 

From  this,  once  GI,  An  and  A22  have  been  fixed,  the  procedure  can  be  iterated  in  order  to 
design  V2  and  V3.  As  a  matter  of  act  the  system  (3.18)  exhibits  the  same  structure  of  the  original 
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error  system  (3.16),  with  the  state  variables  ,  an  z  which  replace  ei  and  z  ,  with  the  vector  of 

control  input  v  which  replaces  v,  and  with  the  Hurwitz  matrix  A  which  replaces  A.  This  allows 
us  to  design  also  va,  and  in  the  third  step  V3,  using  the  same  procedure  used  for  the  SISO  case  and 
come  out  with  a  proportional-derivative  stabilizer  of  the  form 

C  =  S^-GKiei 

V  =  -Kiei  -  K-^e^  + 'i’i  (3  20) 

with 


f  Si  0  0  \  f  kii  0  0  \  f  k2i  0 

5  =  0  0  =  0  kn  0  K2=  \  0  k22 

\  0  0  S2  J  \  0  0  ki2  J  V  0  0 


/  Gi  0  0  \  /’I'lOOX 

C?=  0(?2  0  ^=  0^2  0  . 

\  0  0  Gs  J  \  0  0  J 

Hence  the  overall  control  law  is  obtained  from  the  composition  of  (3.15)  and  (3.20) 
Example  3.5: 

In  our  3-dimensional  numerical  examples  we  have  chosen  ai  =  2, 02  =  3,  as  =  4, 

=  =  2), 


Wo 


25 

0 

w 


so  that 


d{t)  = 


5(sin(ait)  +  cos(o;it) 
10(sin(a2t)  +  cos{a2t) 
5(siii(a3<)  -|-  cos(a3t) 


Next,  we  choose  G\,  G2,  Go  k\  and  ki  so  that  the  conditions  of  Proposition  3.1  are  satisfied. 
We  have  also  set  D\=  D2  =  hxz. 

In  the  numerical  example  we  have  set  A:i  =  10  and 


which  gives 

<if'[S{^Gi  =  1/5,  ^2  32^^2  =  2/3,  '^zSf^Gz  =  1/10 
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and 

87 

+  ^2'>S^2  ^^2  +  'J'i' >^3  ^*^3  =  <  1 

SO  the  conditions  of  Proposition  3.1  are  fulfilled. 

From  the  theory  developed  in  Section  3.1  we  see  that  the  closed  loop  system  can  be  written 
as  X  =  JX  where 


X  = 


.«2 


and,  just  as  in  Example  3.3,  since  there  is  no  z  component,  the  matrix  J  can  be  reduced  by 
deleting  the  first  column  and  first  row,  to  obtain 


r  ^ 


j= 


03x6 


-QKi 

03x3 

-Kt 


Oexs 

hxs 

-(if2-A) 


Since  this  is  a  MIMO  example  we  carmot  directly  appeal  to  root  locus  methods  but  if  we  set 


=  k2hxs,  •and  set  k  =  {k2-  1) 


then  it  is  still  simple  to  compute  a  “root  locus  type  plot”  of  the  closed  loop  poles  as  functions  of 
the  scalar  parameter  A:  >  0. 

Namely,  we  define  the  matrix  to  be  Jo  by 


'  s 

-GKi 

06x3] 

Jo  = 

03x6 

'F 

03x3 

CO 

X  X 

CO  CO 

1 _ 

Then  we  define  matrices 

B'^  = 

(0ix9  1 

1  1) 

,  0-  = 

(0lx9 

so  that 

J  =  Jo  -  kBC 

and  consider  the  eigenvalues  of  J  as  the  closed  loop  poles  of  this  system. 

For  Ai  =  10  we  plot  the  locus  of  the  roots  of  the  characteristic  polynomial  of  J,  viewing  Aa  (or 
A)  as  a  gain  parameter. 
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8 
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4 
2 
0 
-2 

-6 
-8 

-4  -3  -2-1  0  1 

Figure  3-8.  Plot  of  Root  Locus 

In  this  example  we  track  a  three  dimensional  reference  trajectory 

=  [-lOoos  (M) 

where 

't,  0<t<40 

=  <  40  -H  5  sin  >  40  <  t  <  100 

-|(t-  150),  100  <t<150 
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4.0  ROBUST  ADAPTIVE  NONLINEAR  OUTPUT  REGULATION 

-  An  Application  To  UCAV  Flight  Control  - 

In  this  section,  we  discuss  an  application  of  Output  Regulation  methods  to  UCAV  flight 
control.  For  this  purpose,  Neural  Networks  and  Internal  Model  Principle  Nonlinear  Output 
Regulation  (IMPNOR)  methods  are  considered.  These  two  methods  are  applied  to  the  UCAV 
X-45A  linear  model  independently,  and  the  results  are  compared.  We  start  with  the  plant  model. 

4.1  Plant  Model 

Beginning  in  this  section,  adaptive  model  reference  augmentation  of  a  baseline  controller  is 
introduced  using  the  UCAV  X-45A  model,  as  an  example.  It  should  be  noted,  that  the  method  is 
not  restricted  to  this  particular  model,  and  that  it  is  applicable  to  any  plant  dynamics  that  is  taken 
to  be  of  the  form: 

^P  +^^(^+^0  (xj)  =  A^x^  +  BpA(S+  Xq  (Xp ))  (4.1) 

In  (4.1),  using  the  UCAV  X-45A  flight  dynamics  nomenclature, 

Xp={(X  P  P  q  rf  (4.2) 

is  the  f7p=5  -  dimensional  state  vector  that  consists  of  angle  of  attack  a ,  angle  of  sideslip  P , 
body  roll  rate  p ,  body  pitch  rate  q ,  and  body  yaw  rate  r .  The  vector  of  virtual  controls  S  is 

(mxl)  -  dimensional  with  m  =  3,  G  is  the  [Mxm)  -  control  allocation  matrix,  A  is  the 

(mxm)  -  unknown  constant  diasonal  matrix  with  positive  diagonal  elements,  and 

G  AS  =  [SpoB  ^UB  ^RIB  ^RMB  ^ROB  ^Tvec  )'  (4.3) 

is  the  M  =  7  -  dimensional  vector  of  actual  control  inputs,  (positions)  that  consists  of  6  elevons 
and  a  thrust  vectoring.  Also  in  (4.1),  is  a  (nx/?)  -  constant  known  matrix.  5  is  a  (n^xM^ 

-  constant  known  matrix.  isa  -  constant  known  matrix,  and  A^o^x^)  is  the  (mxl)  - 

unknown  state-dependent  vector.  Note  that  while  A  models  unknown  control  failures,  the 
vector-function  ^^0(^/7)  represents  matched  unknown  nonlinear  effects. 

Vertical  acceleration  a^  (positive  down)  is  computed  as  the  system  (4.1)  output  signal. 

a,  =  C,  X,  +  £G  A  (<y + ))  =  C,  X,  +  D,  A  (^ + a:,  (x, ))  (4.4) 

In  addition,  it  assumed  that  the  entire  system  state  x^  is  available  for  control  design 
purposes. 
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4.2  Baseline  Inner-Loop  Controller 

Dynamics  of  the  UCAV  X-45A  baseline  inner-loop  controller  is  defined  as  follows: 


where 

Xc={^,  Pi  n  fS 


(4.5) 

(4.6) 


is  the  n^=4  -  dimensional  controller  state  vector  that  consists  of  pitch  integrator  q, ,  roll 
integrator  p, ,  yaw  integrator  r, ,  and  yaw  rate  washout  filter  signal  .  The  vector  of  inner-loop 
commands 


Ocmd 


^cmd 


,xmd 


)’ 


(4.7) 


is  comprised  of  =4  commanded  signals:  vertical  acceleration  ,  sideslip  ,  roll  rate 

,  and  yaw  rate  .  Combining  (4.5)  and  (4.7)  with  (4. 1),  the  corresponding  augmented 
plant-controller  system  becomes: 


\^cj 


B, 


i  c 


0^ 


+ 


^P 

0 

s. 


A(<y+/c.(x,))- 


0 


B. 


2cJ 
«2 


or  equivalently 

x  =  Ax+B,K{S+K^[xp)]  +  B^u  (4.8) 

Feedback/feedforward  gains  for  the  UCAV  X-45A  baseline  (nominal)  inner-loop  controller 
are  designed  assuming  no  modeling  uncertainties,  A  =  /„,  ,  and  using  the  LQR 

method  with  projection  [16, 17,  18,  19,  20].  The  corresponding  inner-loop  control  system  takes 
the  form: 

S,=Klx^Klu  (4.9) 


where  and  denote  the  («xw)  and  [n^xm)  -  nominal  feedback  and  feedforward  gain 
matrices,  correspondingly.  The  number  of  virtual  controls  is  m  =  3  and  «  =  represents 

the  order  of  the  extended  system  (4.8). 

Nominal  inner-loop  feedback  (4.9),  when  applied  to  the  ideal  (i.e.,  without  uncertainties) 
model  (4.1),  naturally  yields  the  desired  (reference)  dynamics. 

V  =  ( 4  +  B,  Kl ) x,^  *(B,+B,Kl]u  =  A,^ x,„  +  B,„u  (4. 1 0) 

^  . .  . -  - 

^rcf  ^ref 
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Assumption  1:  Reference  model  matrix  is  Hurwitz,  (i.e.,  stable). 

The  inner-loop  control  objective  is  to  design  state  feedback  control  S  for  (4.8)  such  that 
despite  the  system  uncertainties  A  and  ATp  ,  all  closed- loop  signals  are  bounded  and  the 

system  state  x  tracks  the  reference  model  state  . 

To  this  end,  it  should  be  noted  that  the  reference  model  (4.10)  was  written  using  the  baseline 
feedback  gains  designed  for  the  nominal  (i.e.,  no  uncertainties)  X-45A  model.  In  order  to 
achieve  the  inner-loop  control  objective  for  the  system  (4.8)  in  the  presence  of  modeling 
uncertainties  and  unknown  control  failures,  a  direct  adaptive  model  following  control 
architecture  will  be  designed. 

4.3  Inner-Loop  Tracking  Error  Dynamics 

4.3.1  Internal  Model-Based  Controller 

Now  define  the  tracking  error  to  be  regulated  as 

e  =  x-xr^f 

and  seek  an  observable  (in  the  first  approximation)  controller  with  state  and  output  K 
of  the  form 

^  =1 

d  =  K  (#,e) 

such  that  lim  e{t)  =  0  and  ^  remains  bounded.  From  Eqs.  (4.8)  and  (4.10) 

t—^oo 

e  =  Ao(A)e  +  (A)x,,^^  -i-  5,  A[^  +  (e  -f-  x„^ )]  + 

^  ~  ^  ^cmd  ) 

S  =K  {^,e) 

whereAo(A)=A+Bi  AK'^x,  Bo(A)=  B,  (A-I)  K\,  Bu(A)=  B,  (A-I)  K\,  and  the  control 
d  =  k{^,  e)  is  chosen  such  that  the  interconnected  dynamics  are  Hurwitz  in  the  first 
approximation.  In  addition,  the  control  scheme  must  ensure  that  the  nonlinear  pitch  break 
dynamics  do  not  perturb  the  system  away  from  some  bounded  set  near  e  =  0  for  failures  under 
consideration.  The  choice  of  internal  models  is 

=  r,.^,  +  A„.e  -1-  sgn(e,  )e, ^  i  =  1,2,3 

S  =  [S,  S,  Sj 

where  the  nonlinear  conditioning  term  is  chosen  so  as  to  bound  pitch-break  dynamics  based  on 
the  knowledge  that  the  pitch-break  phenomenon  is  quadratic  in  angle  of  attack.  The  gain-matrix 
entries  on  the  nonlinear  error  feedback  terms  are  chosen  large  enough  to  bound  the  pitch-break 
dynamics,  but  not  so  large  that  excessive  control  effort  is  produced.  The  form  shown 

K2iSgn(e^)e^^ 
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gives  a  ‘cubic’  stability  benefit  with  a  ‘quadratic-in-the-error’  time-varying  magnitude.  The 
linear  error-feedback  gain-matrices  K|i  and  Ni  are  chosen  to  make  Ao  Hurwitz  and  to  make  the 
domain  of  attraction  of  the  error  large  enough  to  bound  coupled  error-reference  terms  that  are 
introduced  by  the  quadratic  pitch-break  dynamics.  The  role  of  f i^i  is  to  approximate  the  residual 
purely  reference  terms.  This  controller  gives  bounded  error  tracking  for  the  closed  inner-loop 
dynamics  for  the  failure  cases  considered.  The  outer-loop  problem  was  not  considered,  although 
the  inner-loop  control  is  sufficient  to  keep  the  outer-loop  dynamics  well-contained  for  inner-loop 
commands. 

The  size  of  the  neighborhood  can  be  determined  via  Lyapunov  analysis.  Note  that  this 
controller  yields  approximation  regulation  because  the  pitch-break  phenomenon  introduces  a 
term  that  is  proportional  to  x^rcf  that  is  not  directly  accounted  for  in  the  design  of  the  IMPNOR 
scheme.  However,  this  term  does  not  adversely  affect  the  steady-state  solution,  and, 
consequently,  was  not  taken  into  account.  Although,  it  is  possible  to  realize  a  higher  order 
IMPNOR  scheme  that  contains  a  copy  of  the  dynamics  associated  with  this  quadratic  reference 
signal.  Simulation  results  are  provided  in  Section  4.6.2  for  the  Inner  Loop  and  in  Section  4.9.2 
for  the  Outer  Loop. 

4.3.2  Adaptive  Control  Set-up 

Inner-loop  tracking  performance  can  be  quantified  by  the  corresponding  tracking  error 
signal; 

e  =  x-x^^f  (4.11) 

Similar  to  [13,  14,  15],  direct  adaptive  control  input  is  formed: 

S  =  klx  +  klu-k,(x^)  (4.12) 


In  (4.12),  k^,  k^,  and  ^o(^p)  represent  the  adaptive  feedback/feedforward  matrix  gains 
that  will  be  determined  through  the  Lyapunov  stability  analysis. 

Using  (4.12),  the  corresponding  closed-loop  system  dynamics  becomes: 

x  =  Ax  +  B^Al^kl  x  +  kl  +  w 

=  [A  +  B,Akl)  x+[B,  +  B,hkl)u  +  B,K[-k,(x^)  +  K,(x^)) 

V _  _ /  V _  _ / 

A  B 

Comparing  system  (4.13)  with  the  reference  model  dynamics  (4.10),  the  following 
assumptions  are  stated. 

Assumption  2:  For  a  constant  unknown  diagonal  matrix  A  and  a  constant  control  allocation 
matrix  G  ,  there  exist  "true"  gains  AT*,  AT*  such  that  the  so-called  uncertainty  matching 
conditions  take  place; 


4„  =  .4+s,A(/:;f 

■< 


(4.14) 
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Remark  1:  Knowledge  of  the  true  parameters  Kl,  Kl  will  not  be  required  for  adaptive  control 
design.  Only  their  existence  is  assumed. 

Inner-loop  tracking  error  dynamics  is  obtained  by  subtracting  (4.10)  from  (4.13). 

r  \ 


e  =  +  m-5,A 


(4.15) 


=  4,/  ^  +  {A-Aef)x+[B-B^^f)u-B,KM:^(xp) 


Using  uncertainty-matching  conditions  (4.14),  the  tracking  error  dynamics  becomes: 


e=4.^ye+5,  A 


(k,-k:)"  x*(k,-k:]\-ak,(x,) 


AK, 


AK„ 


(4.16) 


=  4,^  e  +  5,  A  ( AA:J  X -I- AATj  w  -  AAo  (Xp )) 

To  this  end,  we  approximate  the  uncertainty  (4)  using  multi-input-multi-output 
feedforward  neural  network  (NN)  with  radial  basis  function  (RBF)  neurons  in  its  inner  layer, 
[22,  23].  The  network  computes  m  linear  combinations  of  a  suitably  chosen  set  of  radial  basis 

functions  ^(pj  (4)}  [24]. 


^  N, 


A(^)= 


0  ^ 

y=i 


0  ^ 


V>=‘ 


=  0^<I>(x,) 


(4.17) 


The  RBF  NN  Universal  Approximation  Theorem  [21]  states  that  given  an  approximation 
tolerance  £^>0,  and  a  compact  set  X  <zR" ,  there  must  exist  an  integer  and  a  "true" 

constant  matrix  0  e  such  that  for  all  x^e  X  cz  R" : 

K,{x,)  =  e^0(x,)*e,{x,)  (4.18) 


and 


(4.19) 


In  other  words,  given  enough  neurons,  (i.e.,  RBF  fiinctions),  one  can  approximate  a  nonlinear 
function  to  within  any  accuracy. 
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Using  neural  approximation  (4. 1 8),  we  get 

AA:. (x, )  =  )-/:,( jr,)  =  (e-©f  <l>(jr,)-£.(Ar,)  (4.20) 

'  A0  ' 

In  (4.20),  A0  =  0-0,  and  fo  (^^ )  >  represent  unknown  parameter  estimation  errors,  and 
uncertainty  approximation  errors,  correspondingly.  Substituting  (4.20)  into  (4.16)  yields: 

e  =  A^^^e  +  B,A(AKl  x+AKl  u-Ae^  ^x^)  +  e,(x^))  (4.21) 

or,  equivalently 

f  \ 

f  \ 

X 

e  =  A^e+B,A  (aATJ  AK]  AB")  u  +£t{x,)  (4.22) 

4'(x,  u)  J 

Hence,  the  inner-loop  tracking  dynamics  can  be  written  in  the  form: 

e  =  e+Bi  a(aQ^  4^  (x,m)  +  )]  (4.23) 

4.4  Inner-loop  Stability  and  Adaptation 

Without  a  loss  of  generality,  it  is  assumed  that  the  uncertain  matrix  A  has  strictly  positive 
diagonal  elements.  To  this  end,  consider  the  following  positive  definite  Lyapunov  function 
candidate: 

V (e, Af2)  =  e^  Pe-Vtrace[A£f^  F”'  AQ.  A)  (4.24) 

where  trace  of  a  matrix  is  defined  as  the  sum  of  its  diagonal  elements.  Given  a  symmetric 
positive  definite  matrix  0 ,  let  P  denote  the  unique  positive  definite  symmetric  solution  of  the 
Lyapunov  equation: 

(4.25) 

Also  note  that  in  (4.24)  F  is  a  positive  definite  diagonal  symmetric  matrix. 

0  0  ^ 

F=  0  F„  0  (4.26) 

^0  0  Fq^ 
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Computing  time  derivative  of  the  Lyapunov  function  candidate  V  along  the  system  (4.23) 
trajectories,  we  get: 

V{e,AD.)  =  e^Pe  +  e^Pe  +  2  trace ( T"'  Q  A ) 

=  (4^e  +  5,A(AQ^'F(x,M)  +  fo(xJ))  Pe 

(h.z  / ) 

+e^  +  A^Afi^  '¥[x,u)  +  £q  (xp)jj  +  2/race(AQ’^  F"'  QAj 

=  -e^  Qe  +  2e^  PB^  A  AQ^  'F(x,M)  +  2/race^Af2^  F^'  i2A)  +  2e^  P5,  Af{,(xp) 

Applying  the  trace  identity  y  =  tracei^yx^  j  to  the  second  term  in  the  right  hand  side  of 
(4.27),  yields: 

F(e,  AQ)  =  V  Qe+2e^  P5,  Ke^  (x^) 

+2/race(AQ^  T(x,M)e^  PB,  h)  +  2trace[M[f  F”'  QA)  (4.28) 

=  -e^  Qe  +  2e^  P B^  Af(,  (xp)  +  2trace(AQ^  [x,u)e^  P By  +F”‘  Q^Aj 

Based  on  (4.28),  we  choose  adaptive  laws: 

i:2  =  FProj(Q,-4'(x,M)/P5,)  (4.29) 


where  Proj(-,  )  denotes  the  projection  operator,  [25].  It  ensures  that  the  matrix  of  adaptive 

parameters  Q  does  not  exceed  its  pre-specified  norm  bound  ,  and  at  the  same  time  negative 

semi-definiteness  of  the  Lyapunov  function  is  maintained.  Due  to  (4.29),  an  upper  bound  for  the 
time  derivative  of  the  Lyapunov  function  can  be  found. 


V(e,ACi)<-e'  Qe+2e’^ PB,Ae,{x^) 

=  -H(4i.(e)H-2||PS,|n>ax(A)£:) 


(4.30) 


Consequently,  the  Lyapunov  function  derivative  V  (e,  AQ)  becomes  semi-negative  outside 
of  the  compact  set: 


Z  = 


e  < 


2||P5,||max(A)fo 


(4.31) 


llDli<a 
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Thus,  adaptation  (4.29)  yields  bounded  tracking.  Moreover,  due  to  the  diagonal  form  of  T , 
the  adaptive  laws  can  explicitly  be  written  as: 

Proj(^,,-x/P5,) 

<k^=T„  Proj(^„,-M/P5,)  (4.32) 

0  =  re  Proj(e,<D(xJ/P^,) 


4.5  Adaptive  Augmentation  of  the  Baseline  Inner-Loop  Controller 

Since  the  structures  of  the  baseline  inner-loop  controller  (4.9)  and  of  the  adaptive  feedback 
(4.12)  are  similar,  the  former  could  be  imbedded  into  the  latter. 

s=SL(x,u)+kl  x+ kl  V- s^(x,u)- k„(x^) 

=  +[k.-Kd  x*[k.-Kd  u-^x^  (4.33) 

^  V  ^  . .  V 

nominal  adaptive 

In  other  words,  the  adaptive  system  augments  the  baseline  inner-loop  controller.  In  this  case, 
adaptation  of  the  incremental  feedback  gains  k,  k,  ©  starts  from  zero  initial  conditions  and 
has  the  following  dynamics; 


k=T^?xo][k,-xe^  PB,],  ^,(0)  =  0„ 

4=r„Proj(4,-«/P5,),  4(o)=o« 

4)  =  reProj(4<D{xJ/P5,),  0(O)  =  O^^,„ 


-'n'X.p 


(4.34) 


Note  that  in  (4.34),  the  projection  operator  depends  on  two  vector  arguments  and  is  defined 
column- wise: 


Proj(^,y)  = 


if{/(^)>0and/V/(^)>0} 


(4.35) 


[y,  if  not 

The  corresponding  convex  cost  function  and  its  gradient  are  given  below: 


f{y)= 


V/(y)  =  -y 


|2  /t-tL 


^3- 


(4.36) 
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In  (4.36)  denotes  the  norm  upper  bound  imposed  on  the  parameter  vector  y ,  while  e^. 

defines  the  projection  tolerance.  Detailed  description  of  the  projection  operator  is  given  in 
Appendix  A. 

Designed  to  yield  bounded  tracking,  the  adaptation  process  (4.34)  has  the  ability  to  not  only 
change/reconfigure  the  nominal  feedback  gains  and  ,  but  also  to  identify  and  counteract 

the  nonlinear  matched  uncertainty  Kq  )  in  the  system  (4.1).  As  a  result,  bounded  tracking  of 

the  reference  model  trajectories  is  achieved  in  spite  of  unknown  control  failures  and  unmodeled 
nonlinear  dynamical  effects.  The  corresponding  closed-inner-loop  system  block-diagram  is 
shown  Figure  4-1. 


Figure  4-1.  Closed-Inner-Loop  System  Block  Diagram 

To  this  end,  it  should  be  pointed  out  that  the  adaptive  system  (4.34)  was  derived  using  RBF 
NN  approximation  (4.18).  An  alternative  approach  that  uses  Single-Hidden-Layer  (SHL) 
feedforward  neural  networks  with  sigmoidal  activation  units  instead  of  the  radial  basis  functions 
is  presented  in  Appendix  B.  The  two  neural  networks,  RBF  NN  and  SHL  NN,  are 
complimentary  in  their  approximation  abilities  [21]. 

4.6  UCAV  X-45A  Inner-Loop  Design  and  Evaluation 

4.6.1  Neural  Networks-Based  Controller 

The  adaptive  augmentation  approach  illustrated  in  Figure  4-1  was  utilized  for  the  design  of 
the  UCAV  X-45A  adaptive  reconfigurable  inner-loop  controller.  For  the  design  purposes,  open- 
loop  X-45A  aircraft  data  (i.e.,  plant)  were  taken  to  represent  low  speed  dynamics  at  Mach  =  .3, 
Altitude  =  5K  ft,  and  angle  of  attack  AOA  =  6  degrees. 

Baseline  inner-loop  feedback  /  feedforward  gains  in  (4.9)  were  designed  using  the  LQR 
method  with  output  projection,  [17,  18,  19,  20,  25]. 

Using  relation  (4.10),  the  reference  model  matrices  and  were  computed.  It  was 

verified  that  the  reference  model  matrix  was  Hurwitz,  (i.e..  Assumption  1  was  satisfied). 

While  the  and  the  3"^^  components  of  the  modeling  uncertainty  )  were  set  to  zero, 

its  2"'*  component  was  chosen  to  represent  the  so-called  “Pitch  Break  phenomenon”  that 
depended  solely  on  angle  of  attack,  (AOA).  Figure  4-2  shows  the  corresponding  data. 


Robust  Adaptive 
Control  of  UCAVs 


34 


GP49152 


Robust  Adaptive  Final  Report 

Control  of  UCAVs _ boeing  -  stl  2004P0037 


Figure  4-2.  Pitch  Break  vs.  AOA  data 


Radial  basis  functions  were  chosen  in  the  form  of  Gaussians: 


f 

iPj(xp)  =  exp  ■ 


[a -ajf  + 


V  y 

AOA  break  points  ^  were  spaced  evenly  between  -5  and  5  degrees,  and  half  a 

degree  apart  from  each  other.  The  Gaussian  widths  were  set  to  tr  =  0.5 .  The  latter  provided  for 
a  reasonable  overlap  between  the  individual  basis  functions.  Also,  a  unity  basis  function 


(P22  )  =  1  was  added  to  the  RBF  NN  in  (4. 17),  [N^  -  22) . 


The  adaptation  rates  in  (4.32)  were  chosen  to  be: 

'r^=100diag([0  0  1  1  1  0  0  0  0]) 
.r„=0 

re=I00eye(Wo) 


where  “diag”  and  “eye”  denoted  diagonal  and  identity  matrices,  correspondingly.  Note,  that  in 
general  the  Lyapunov  function  in  (4.24)  requires  nonsingular  matrices  of  adaptation  rates. 
However,  since  the  inner-loop  dynamics  primarily  depends  on  body  angular  rates,  the  rates  of 
adaptation  chosen  above  are  adequate,  and  the  rest  of  the  system  dynamics  can  be  treated  as  a 
modeling  uncertainty  that  is  approximated  and  cancelled  on-line  by  the  RBF  NN. 

Positive  semi-definite  solution  of  the  Lyapunov  equation  (4.25)  was  obtained  by  choosing 
0  =  diag([O  0  2.5  2.5  2.5  0  0  0  O]) 
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In  order  to  constrain  the  inner-loop  adaptation  parameter  values  in  (4.34),  column-wise  norm 
upper  bounds  in  (4.36)  were  set  to  unity,  and  projection  tolerances  £^.  were  chosen  to  be 

10%  of  the  norm  upper  bounds. 

Closed-inner-loop  system  performance  was  evaluated  in  the  UCAV  X-45A  flight  simulation 
environment  that  included  1®'  order  actuator  models  with  a  time  constant  as  well  as  CrowMix  - 
the  X-45A  nonlinear  control  allocation  logic.  Various  control  failure  scenarios  with  and  without 
the  pitch  break  phenomena  were  tested.  Simulated  performance  with  adaptation  was  compared 
to  the  performance  provided  by  the  baseline  inner-loop  system  without  the  adaptation.  The 
simulation  evaluation  results  are  shown  in  Figures  4-3,  4-4,  and  4-5.  For  comparison  purposes, 
simulation  data  were  obtained  from  the  following  three  closed-inner-loop  systems:  a)  Adaptation 
OFF  /  Failures  OFF  (blue),  b)  Adaptation  OFF  /  Failures  ON  (red),  and  c)  Adaptation  ON  / 
Failures  ON  (black). 

Figures  4-3a,  4-3b,  and  4-3c  demonstrate  adaptation  benefits  when  the  right  outboard  (ROB) 
elevon  fails  at  1  second  into  the  maneuver.  Figure  4-3a  indicates  that  in  spite  of  the  unknown 
control  failure,  the  adaptive  system  is  able  to  quickly  reconfigure  and  track  the  commanded 
vertical  acceleration,  sideslip  angle,  roll  rate,  and  yaw  rate  signals,  simultaneously.  In  fact, 

Figure  4-3a  shows  that  with  the  adaptation  turned  on  the  desired/nominal  system  tracking 
behavior  has  been  almost  recovered.  In  addition.  Figure  4-3b  compares  the  three  virtual  control 
feedback  signals,  and  Figure  4-3c  confirms  that  the  level  of  control  activity  is  reasonable  and  that 
no  control  saturation  has  occurred  during  the  adaptation  process. 


Time,  sec 


Figure  4-3a.  Inner-Loop  Adaptation  with  ROB  Elevon  Failure:  Command  Tracking 
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iure  4-3b.  Inner-Loop  Adaptation  with  ROB  Eievon  Failure:  Virtual  Controls 


Time,  sec 


Figure  4-3c.  Inner-Loop  Adaptation  with  ROB  Eievon  Failure:  Controls 
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Figures  4-4a,  4-4b,  and  4-4c  demonstrate  adaptation  benefits  when  in  addition  to  the  ROB 
elevon  failure  the  Pitch  Break  phenomenon  is  active  throughout  the  entire  maneuver.  As  seen 
from  the  data,  the  adaptive  process  removes  the  unwanted  oscillations  and  quickly  “learns”  to 
cancel  the  unknown  Pitch  Break  nonlinear  phenomena.  As  expected,  all  of  the  control  signals 
remain  well  within  their  position  and  rate  limits. 


Time,  sec 


Figure  4-4a.  Inner-Loop  Adaptation  with  ROB  Elevon  Failure  and 
Pitch  Break  Phenomenon:  Command  Tracking 
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Figure  4-4b.  Inner-Loop  Adaptation  with  ROB  Elevon  Failure  and 
Pitch  Break  Phenomenon:  Virtual  Controls 


Time,  sec 


Figure  4-4c.  Inner-Loop  Adaptation  with  ROB  Elevon  Failure  and 
Pitch  Break  Phenomenon:  Controls 
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Figures  4-5a,  4-5b,  and  4-5c  demonstrate  adaptation  benefits  when  the  inner-loop  system 
receives  repetitive  vertical  acceleration  commands  in  the  presence  of  both  the  ROB  elevon 
failure  and  the  unknown  Pitch  Break  phenomenon.  The  data  in  Figure  4-5a  show  that  while  the 
nominal  system  provides  consistent  (oscillatory)  performance,  the  adaptive  process  quickly 
dampens  the  oscillations  caused  by  the  uncertainties  and  significantly  reduces  the  corresponding 
control  activity. 
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Figure  4-5a.  Inner-Loop  Adaptation  with  Vertical  Acceleration  Commands,  ROB  Elevon  Failure, 
and  Pitch  Break  Phenomenon:  Command  Tracking 
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Figure  4-5b.  Inner-Loop  Adaptation  with  Vertical  Acceleration  Commands,  ROB  Elevon  Failure, 

and  Pitch  Break  Phenomenon:  Virtual  Controls 


Time,  sec 


Figure  4-5c.  Inner-Loop  Adaptation  with  Vertical  Acceleration  Commands,  ROB  Elevon  Failure, 

and  Pitch  Break  Phenomenon:  Controls 
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Adaptive  inner-loop  system  performance  was  also  tested  in  the  UCAV  X-45A  high  fidelity 
nonlinear  6-DoF  flight  simulation  environment,  (MATRIXx).  Figure  4-6  compares  performance 
of  the  three  systems:  a)  Baseline/No  Failures  (black),  b)  Baseline/Failures  ON  (red),  and  c) 
Adaptation  ON/Failures  ON.  As  seen  from  the  figure,  the  adaptive  system  is  able  to  recover  its 
baseline/no  failure  performance,  while  accurately  tracking  the  desired  inner-loop  commands. 


Figure  4-6.  Inner-Loop  Adaptation  with  RIB/ROB  Elevon  Failures  @  Trim 


The  adaptive  system  performance  was  evaluated  using  an  extensive  matrix  of  test  cases. 
Linear  data  for  the  reference  model  (4.10)  were  scheduled  throughout  the  aircraft  flight 
envelope.  Required  by  the  adaptive  laws  (4.34),  the  unique  symmetric  positive  definite  solution 
P  of  the  corresponding  algebraic  Lyapunov  equation  (4.25)  was  approximated  on-line  using  the 
iterative  procedure  outlined  in  Appendix  C. 


42 


GP49152 


Robust  Adaptive 
Control  of  UCAVs 


Final  Report 

BOEING  -  STL  2004P0037 


4.6.2  Internal  Model  Principle-Based  Controller 

An  IMPNOR  control  scheme  was  tested  for  various  inputs  and  failure  conditions.  Figure 
4-7a  shows  the  simulation  results  of  the  inner-loop  system  with  the  IMPNOR  controller  when  the 
right  outboard  surface  (ROB)  failure  is  present.  As  can  be  seen,  the  IMPNOR  controller 
sufficiently  recovers  the  nominal  response  of  the  aircraft,  by  reducing  the  oscillation.  In  Figure 
4-7a,  4-7b,  and  4-7c,  the  legend  IMP  is  used  for  IMPNOR. 
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Figure  4-7a.  Aircraft  Response  without  Pitch  Break  Phenomenon  with  IMPNOR  Control 
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For  the  body  axis  angular  accelerations,  P,Q,R  the  IMPNOR  controller  adequately  reduces 
the  oscillations  that  they  experience  with  a  ROB  failure,  as  can  be  seen  in  Figure  4-7b  below. 
Although  the  angular  accelerations  P  and  Q  still  exceeded  that  of  nominal,  the  aircraft  motion 
is  under  greater  control  than  that  without  the  IMPNOR  controller. 


Figure  4-7b.  Aircraft  Acceleration  Response  without  Pitch  Break  Phenomenon 
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Figure  4-7c  shows  the  deflections  of  the  individual  surfaces  and  that  the  aircraft’s  surfaces 
are  within  actuation  position  limits  and  do  not  suffer  from  the  same  oscillations  that  occur 
without  the  IMPNOR  controller. 


Time  (sec) 


Figure  4-7c.  Surface  Deflections  in  Flight  Conditions  without  Pitch  Break 
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Next,  the  same  conditions  and  inputs  were  applied  to  the  same  aircraft  model,  only  this  time 
with  the  Pitch  Break  phenomenon  active.  From  Figures  4-8a,  4-8b,  and  4-8c,  we  see  that  the 
IMPNOR  controller  is  able  to  recover  the  nominal  aircraft  response  when  the  right  outboard 
surface  failure  is  present.  In  fact,  the  controller  is  able  to  dampen  out  the  additional  errors 
induced  by  the  Pitch  Break  phenomenon  in  Az  and  |3  better  than  the  baseline  controller,  giving 
much  better  damped  responses  to  all  inputs. 
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Figure  4-8a.  Aircraft  Response  with  Pitch  Break  Phenomenon 


46 


GP49152 


Rdot  (d^/sec^)  ^  ^  Pdot^deg/sec^)  Pdot  (deg/sec^) 


Robust  Adaptive  Final  Report 

Control  of  UCAVs _ boeing  -  stl  2004P0037 


4.7  UCAV  X-45A  Guidance  (Outer-Loop)  System  Augmentation 

In  this  section,  the  X-45A  guidance  system  is  introduced  into  the  adaptive  design  process. 
Two  guidance  models,  =  3  -  dimensional  longitudinal  and  =  2  -  dimensional  lateral- 

directional,  are  introduced: 


I  ^long  ^long  ^long  ^tong  ^long 


y long  ^hng  ^long  ^long  ^long  ^ 


I’cmd 


I  ^latd  ~  Aatd  ^latd  ^latd  ^lafd 
\yiafd  “  ^latd  ^latd  ^latd  ^latd  “  ^ 


vmd 


(4.37) 

(4.38) 


The  three  dimensional  input  vector  to  the  longitudinal  guidance  model  (4.37)  is  defined 
as: 


where  h  and  h  denote  commanded  altitude  (ft),  actual  altitude  (ft),  and  aircraft  vertical 
speed  (ft/see).  Commanded  altitude  acceleration  (ft/sec/sec)  is  computed  as  the  system 
(4.37)  output  signal,  (that  is  =  A™"' ). 

The  two  dimensional  input  vector  to  the  lateral-directional  guidance  model  (4.38)  is 
defined  as: 

wf  (4-40) 


where  y/™'^  and  y/  denote  eommanded  and  actual  heading  (yaw)  angles  in  degrees. 

Commanded  roll  angle  (deg)  is  computed  as  the  system  (4.38)  output  signal,  (that  is 
Based  on  the  latter,  the  commanded  roll  rate  (deg/sec)  formed: 

-(l>)k^  (4.41) 

In  (4.41),  positive  constant  represents  a  commanded  roll  gain  (1/deg).  In  addition,  the 
commanded  yaw  rate  (rad  /  sec)  signal  is  specified  as: 

fcmd  ^  (4  42) 


where  V  denotes  the  aircraft  true  airspeed. 

Using  the  two  guidance  outputs  and  ,  vertical  acceleration  command  is 
computed: 


cmd 


(4.43) 
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Combining  (4.43),  (4.41)  and  (4.42),  the  vector  of  inner-loop  commands  is  formed: 

(4.44) 

In  order  to  properly  account  for  the  4  extra  degrees  of  freedom  (/z,  h,  used  in  the 

guidance  model,  their  approximated  dynamics  is  written  as: 

h  =  -a^  cos^ 

(4.45) 

t/  =  rcos(Z>-t-^sin^ 

Let  ]  and  b'f  ]  .  Augmentation  of  the  (Plant  + 

Inner-Loop  Controller)  -  system  (4.8)  with:  a)  extra  4  degrees  of  freedom  (4.45),  b)  longitudinal 
guidance  (4.37),  and  c)  lateral-directional  guidance  (4.38),  results  in  the  augmented  dynamics: 

x  =  Ax  +  B^  A^S+Kf^i^Xp^^  +  B^u 

f 

h  =  -  (C,  0)j:+Z),A(^+/:.(jr,)) 

C  ’ 

<P  =  P 

i/r  =  r  cosi^  +  q  sintp 

+ b'r  + yr  h + yr  h 


('P‘^-<P)K 


gsin^ 

V 
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Assuming  “small  angles”  and  rewriting  (4.46)  in  a  matrix  form,  we  get: 


/ 

JC  ^ 

f  A 

0 

0 

0 

0 

0 

0  ^ 

X  ^ 

h 

0 

0 

1 

0 

0 

0 

0 

h 

h 

-C 

0 

0 

0 

0 

0 

0 

h 

z= 

^3 

0 

0 

0 

0 

0 

0 

¥ 

^5 

0 

0 

0 

0 

0 

0 

¥ 

^long 

0 

0 

0 

Aong 

0 

^long 

u 

0 

0 

0 

0 

Aard 

K^lald  J 

X 

i 

X 

{ 

\ 

f  0 

0 

0 

0 

0 

0 

0 

0 

-D 

p 

0 

0 

0 

0 

0 

a(<^ 

+  ^o(^P 

))+ 

0 

U-h 

0 

0 

0 

j^cmd 

0 

\ 

0 

0 

0 

0 

y’”\ 

0 

0 

blcg 

0 

0 

u 

I  0 

J 

.0; 

j 

[  0 

0 

rlatd 

B,  B, 


or,  equivalently: 

X  =  a{^S + Kq  j  m  +  ^3  w  (4.47) 


In  the  formulation  of  the  augmented  system  matrix  in  (4.47),  two  unit  vectors  [e^,  e^) 
were  introduced: 

e3  =(0  0  1  0  0  0  0  0  of 

< 

e5  =(0  0  0  0  1  0  0  0  of 

Additionally,  matrix  element  A2  ^2  was  set  to  a  nonzero  value  that  represented  the  aircraft 
bank  angle  {(p)  influence  on  the  sideslip  (y5)  dynamics. 

Let  =  \d‘r^  ] ,  and  =  [<"'  d‘f  ] .  Then  using  again  the  “small 

angles”  assumption,  the  vector  u  of  inner-loop  commands  (4.44)  becomes: 


(4.48) 


(4.49) 
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In  (4.49),  guidance  output  signals  h™‘'  and  are  defined  by  (4.37)  and  (4.38), 
correspondingly. 

X, /!'■■'+<■'  h+d'r  h 
[(»"'=  Q-  ^,-+4“  V’-'+df  W 

Using  relations  (4.50)  yields: 


u  — 


-Q..  -  d'r  h  -  d'r  i, 

j^cmd 

V 


or,  equivalently,  in  a  matrix  form: 


^0  - 

0 

0  -C, 

long 

0  ^ 

h 

0 

0 

0 

0 

0  0 

0 

h 

= 

0 

0 

0 

^ laid 

<t> 

0 

0 

0 

g_ 

0  0 

0 

¥ 

V 

J 

^long 

V 

1 

K.^latd  > 

1 

0 

0  ^ 

0 

1 

0 

j^cmd 

=  F^x  +  F^u 

0 

0 

.,-cmd 

V 

0 

0 

0  J 

(4.50) 


(4.51) 


Finally,  substituting  (4.51)  into  (4.47)  yields  the  complete  system  dynamics: 

j  =  [A,+B,Fl)x  +  B,K[d+K,[x^)]  +  [B,  +  B,F^)u 

\ - ^ ^ - y - ^ 

A 

=  Ax  +  B^  A^(5'+i^o  (xp)j  +  jB4  u 


(4.52) 


Note  that  in  (4.52)  a  three  dimensional  vector  of  the  outer-loop  commands 
u  —  was  introduced.  From  (4.51)  it  follows  that  the  inner-loop  commands 

u  are  determined  through  the  outer-loop  commands  u  and  the  system  state  x  using  the 
feedforward  gain  matrix  F^  and  the  feedback  gain  matrix  FJ ,  correspondingly. 
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where  the  symmetric  positive  definite  matrix  P  satisfies  algebraic  Riccati  equation: 

AlfP+P  Aef -PKfP''KfP+Q  =  o 


(4.57) 


Using  (4.55)  and  (4.56),  the  corresponding  closed-loop  reference  model  dynamics  become 
asymptotically  stable  and  can  be  written  as: 


^ref  ~  (Are/  ^ref  Pj  )  ^re/  Pv  ~  A^ef  ^ref 


'ref  ^ 


(4.58) 


BZf 


At  the  same  time,  feedforward  gain  can  be  chosen  to  achieve  a  desired  DC  gain  matrix 
G^^j-  of  the  reference  model  (4.58). 

(4.59) 

The  unique  optimal  solution  of  (4.59)  is: 

fJ  (4.60) 

' - - ' 

where  S^^j-  is  the  pseudo-inverse  matrix  for  the  rectangular  matrix 

Note,  that  the  newly  derived  asymptotically  stable  reference  model  (4.58)  is  similar  to  the 
original  reference  model  (4.54).  Consequently,  without  a  loss  of  generality  and  throughout  the 
paper  it  is  assumed  that  matrix  is  Hurwitz  to  begin  with,  and  that  the  reference  model  is  of 

the  form  of  (4.54).  With  that  in  mind,  the  outer-inner-loop  control  objective  can  now  be 
formulated. 

The  outer-inner-looD  control  objective  is  to  design  state  feedback  inner-loop  control  S  for 
the  complete  system  (4.52)  such  that  despite  the  system  uncertainties  A  and  (x^ ) ,  all  closed- 

loop  signals  in  (4.52)  are  bounded,  and  the  system  state  x  tracks  the  state  x^^^  of  the  reference 
model  (4.54). 

Since  the  system  (4.52)  and  the  reference  model  dynamics  (4.54)  are  similar  to  the  system 
(4.8)  and  its  corresponding  reference  dynamics  (4.10),  the  adaptive  control  design  process  is  also 
similar  to  the  design  that  was  introduced  in  Sections  4.3, 4.4,  and  4.5.  Repeating  the  design  steps 
that  lead  to  the  inner-loop  adaptation  (4.34),  the  new  adaptive  laws  can  be  written  that  explicitly 
account  for  the  guidance  loops: 

4=ri  Proj(4.-Je''?S|),  4(0)  =  0... 

4(0)  =  0„,  (4.61) 

4  =  re  Proj(0,<I>(x,)e"Ffi,),  0(O)  =  O„,.„ 
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In  (4.61),  e  =  represents  the  complete  system  tracking  error,  P  is  the  unique 

symmetric  positive  definite  solution  of  the  algebraic  Lyapunov  equation: 

ALfP  +  PAef=-Q  (4-62) 

and  0  is  a  symmetric  positive  definite  matrix.  In  addition,  symmetric  positive  definite  matrices 
Ty ,  ,  Fq  must  be  chosen  to  specify  desired  rates  of  parameter  adaptation. 

As  it  was  done  in  (4.33),  the  adaptive  incremental  control  is  embedded  into  the  nominal 
inner-loop  linear  feedback.  The  total  inner-loop  control  takes  the  form: 

S  =  SL{xp,Xc,u)  +  kIx+kIu-Q^^[xp)  (4.63) 

V - ^ - .  > - ^ . 

nominal  adaptive 

It  is  designed  to  a)  change  /  reconfigure  the  nominal  feedback  gains  ,  Ky,h)  feed  back  / 
feed  forward  the  guidance  states  /  commands,  and  c)  identify  /  counteract  the  nonlinear  matched 
uncertainty  Kq  )  in  the  original  system  (4.1).  As  a  result,  bounded  tracking  of  the  reference 

model  (4.54)  trajectories  is  achieved  in  spite  of  unknown  control  failures  and  unmodelled 
nonlinear  dynamical  effects.  The  corresponding  closed-inner-outer-loop  system  block-diagram 
is  shown  below. 


Figure  4-9.  Closed-Inner-Outer-Loop  System  Block  Diagram 


Note  that  if  the  reference  model  is  defined  by  (4.58)  instead  of  (4.54)  then  P  should  be  taken 
as  the  unique  solution  of  the  algebraic  Riccati  equation  (4.57).  Also,  the  total  inner-loop  control 
input  takes  the  form: 

S  =  S,(x^,x^,u)  +  k^x+k^,(Fjx  +  Flv)-e^^x^)  (4.64) 

' - V'l - ^  ' - V - 

nominal  guidance  augmentation 

V - - 7 

adaptive 

with  the  inner-loop  commands  vector  u  defined  as  in  (4.51). 
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u  =  F^^  x  +  F[  u=F^^  x  +  F^  (fI  x  +  FJv) 
=  +  F[  Fl)x  +  ^^V  =  Ff  x  +  F^‘  V 


The  corresponding  closed-loop  system  block-diagram  is  shown  in  Figure  4-10. 


Figure  4-10.  Closed-Inner-Outer-Loop  System  Block  Diagram 
with  Guidance  Augmentation 


(4.65) 
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Figure  4-11a.  Inner-Outer-Loop  Adaptation  during  Max  Rate  Climb/Level  Off 
with  ROB  Eleven  Failure  and  Pitch  Break  Phenomena 


Figure  4-1 1b.  Inner-Outer-Loop  Adaptation  during  Max  Rate  Climb/Level  Off 
with  ROB  Eleven  Failure  and  Pitch  Break  Phenomena 
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Time,  sec 


Figure  4-1 1c.  Inner-Outer-Loop  Adaptation  during  Max  Rate  Climb/Level  Off 
with  ROB  Eleven  Failure  and  Pitch  Break  Phenomena:  Control  Deflections 


4.9.2  Internal  Model  Principle-Based  Controller 

To  evaluate  the  performance  of  the  IMPNOR  controller  with  the  Outer  Loop  command  for  a 
climb  and  level-off  maneuver,  simulations  were  performed  and  the  results  are  shown  in  Figure 
4-12a,  4-12b,  and  4-12c.  Note  that  the  Pitch  Break  phenomenon  is  active.  As  can  be  seen  from 
Figure  4- 12a,  4- 12b,  and  4- 12c,  unwanted  oscillations  caused  by  either  the  surface  failure  and/or 
the  Pitch  Break  phenomenon  are  effectively  cancelled  while  keeping  the  surfaces  within  their 
actuator  position  limits. 
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Figure  4-1 2a.  Aircraft  Response  to  Climb/Level-off  with 
Pitch  Break  Phenomenon  and  ROB  Failure 


0  5  10  15  20  25  30  35 

Time,  sec 

Figure  4-12b.  Aircraft  Angular  Acceleration  Response  to  Climb/Level-off  with 
Pitch  Break  Phenomenon  and  ROB  Failure 
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Figure  4-12c.  Aircraft  Guidance  Response  to  Climb/Level-off  with 
Pitch  Break  Phenomenon  and  ROB  Failure 


4.10  Conclusions 

Design,  stability  analysis  and  simulation  evaluation  of  a  direct  adaptive  model  following 
control  system  was  presented.  The  adaptive  laws  were  theoretically  justified  and  derived  based 
on  the  fundamentals  of  the  Lyapunov  Stability  Theory.  The  system  was  designed  to  augment  an 
existing  nominal  robust  baseline  controller  with  an  adaptive  online  trained  feedforward  neural 
network.  The  adaptive  element  provided  for  reconfiguration  in  the  presence  of  system 
uncertainties  (e.g.,  battle  damage),  control  failures,  and  environmental  disturbances.  Using 
UCAV  X-45A  model  and  data,  a  step-by-step  adaptive  control  design  process  was  presented. 
Radial  basis  functions  (RBF)  and  single-hidden-layer  (SHL)  feedforward  neural  network  (NN) 
architectures  in  conjunction  with  the  projection  operator  were  employed  to  derive  the  adaptive 
laws.  The  adaptive  element  provided  bounded  tracking  and  a  uniform  ultimate  boundedness  of 
all  the  signals  in  the  corresponding  closed-loop  system.  Both  the  inner-loop  design  and  the 
outer-inner-loop  (explicitly  accounting  for  guidance)  adaptations  were  performed.  Benefits  of 
the  adaptive  system  and  its  performance  were  evaluated  in  the  UCAV  X-45A  flight  simulation 
environment. 
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5.0  ADAPTIVE  CONTROL  OF  ELECTROMAGNETIC  ACTUATORS 


Actuator  nonlinearities  are  among  the  key  factors  limiting  both  static  and  dynamic 
performance  of  feedback  control  systems.  Harmful  effects  of  backlash  in  gears  are  well  known. 
In  fact,  such  nonlinearities  were  observed  and  reported  in  a  technical  memo  [27].  In  this  memo, 
it  was  reported  that  the  quantizer  and  backlash  effects  dominated  the  nonlinear  response,  and  that 
the  response  became  very  nonlinear  as  the  command  magnitude  (fm  deflection  angle  of  a 
missile)  decreased. 

Backlash  prevents  accurate  positioning  and  may  lead  to  chattering  and  Iimit-cycle-t5q3e 
instabilities.  This  increases  wear  and  tear  on  the  gears,  which  in  turn,  increases  backlash. 

Various  anti-backlash  gears  have  been  proposed  as  a  solution  to  this  problem.  However,  their 
cost  is  high,  and  they  introduce  extra  weight  and  friction.  Attempts  to  eliminate  or  reduce 
backlash  by  increasing  the  gain  of  the  feedback  loop  lead  to  sustained  oscillations  which  may 
cause  rapid  wear  of  gear  trains,  valves,  and  other  components.  Thus,  intelligent  but  inexpensive 
approaches  to  increase  accuracy  against  the  nonlinearities  in  the  actuator  dynamics  are  needed  in 
order  to  improve  aircraft  and  missile  performance  without  additional  cost. 

G.  Tao  and  P.  Kokotovic  have  done  extensive  investigation  on  adaptive  control  of  systems 
with  actuator  and  sensor  nonlinearities  [28].  We  will  follow  their  methodology  (we  will  call  it 
“Adaptive  Inverse  Method”  in  this  report)  to  implement  an  effective  nonlinear  control  scheme 
for  actuators.  We  will  also  apply  the  Internal  Model  Principle  technique  to  compare  the  results 
with  those  by  the  Adaptive  Inverse  method. 

5.1  Inverse  Models 

The  method  proposed  by  Tao  and  Kokotovic  uses  an  inverse  model  to  cancel  the  nonlinear 
effects  induced  by  dead-zone  and  backlash.  That  is,  if  we  let  u  =  N(v)  represent  the  nonlinear 
element  in  the  actuator,  where  v  denotes  the  input  to  the  actuator  and  u  the  output  from  the 
actuator,  then  the  compensator  for  the  input  nonlinearity  N(.)  is  v  =  Nl(ud).  Here,  NI(.)  is  the 
desired  inverse  and  u<i  is  the  control  input  which  would  achieve  the  control  objective  in  the 
absence  of  N(.).  Thus,  when  the  inverse  is  exact,  u  =  N(NI(u<i))  =  Ud  achieves  the  control 
objectives  as  if  N(.)  were  absent.  However,  in  reality,  NI(.)  is  seldom  exactly  known  and  only  an 
estimate  NIcst(.)  of  the  inverse  NI(.)  can  be  implemented. 

5.1.1  Dead-Zone  Inverse 

Let  br  (>  0)  and  bi  (<  0)  denote  break-points,  and  mr  and  mi,  represent  the  slopes  of  the  linear 
portion  of  the  dead-zone  characteristics  as  shown  in  Figure  5-1.  For  a  linear  parametrization  of  a 

dead-zone  inverse,  we  use  the  estimates  ,  niib, ,  m,  of  m^b^ ,  ,  m,bi ,  m, , 

respectively.  The  estimates  of  b^ ,  b,  are  then  obtained  from 


b..  = 


_  m,.b,. 


m,. 


m 


(5.1) 
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Then  the  inverse  model  for  the  dead-zone  characteristic  is  described  [24]  by 

V(t)  =  DIcst(Ud(t))  = 


nti 


UdQ)  +  mA 


if  Ud(t)  >  0 
if  Ud(t)  =  0 
if  Ud(t)  <  0 


(5.2) 


5.1.2  Backlash  Inverse 

One  of  the  damaging  effects  of  backlash  on  system  performance  is  the  delay  corresponding 
to  the  time  needed  to  traverse  an  inner  segment.  Another  undesirable  effect  of  backlash  is  the 
information  loss  occurring  on  an  inner  segment  when  the  output  u(t)  remains  constant  while  the 
input  v(t)  continues  to  change  as  shown  in  Figure  5-2. 


62 


GP49152 


Robust  Adaptive 
Control  of  UCAVs 


Final  Report 

BOEING  -  STL  2004P0037 


These  undesirable  effects  are  eliminated  with  the  backlash  inverse  v(t)  =  BI(ud(t))  where  v(t) 
is  determined  by 

•  1  •  •  IJ  (t^ 

v(t)  =  —  Ud(t)  if  Ud(t)>0,  v(/)  =  ~  -  +  c,.,  or 
m  m 

if  w4/)<0,v(/)  =  ^^  +  c, 
m 


=  0  if  u{t)  =  0 

-  g(t,t)  if  u{t)  >  0,  v(0  =  +  c, 

m 

=  -g(t,t)  if  u{t)  <  0,  v(/)  =  +  c,,.  (5.3) 

m 

In  this  definition,  the  inverse  of  a  horizontal  segment  of  the  backlash  characteristic  is  a 
vertical  jump  defined  as  the  time  integral  of  the  impulse  function 

g(T,t)  =  6(x-t)(crcO  (5.4) 

where  8(t)  is  the  Dirac  5-function.  Thus  an  upward  jump  in  the  backlash  inverse  is 

v(t^)  =  v(f)+  fg(T,  t)dT  =  -I-  c,.  (5.5) 

•  m 

I 

The  effect  of  this  jump  in  the  backlash  inverse  model  will  be  to  eliminate  the  delay  caused  by 
a  segment  in  the  backlash  characteristic.  In  a  similar  manner,  the  use  of  the  backlash  inverse 
model  given  by  (5. 3-5. 5)  restores  the  information  that  would  have  been  lost  without  it.  When  the 

A 

backlash  parameters  m,  ci,  Cr  are  unknown,  we  will  replace  m,  ci,  and  Cr  with  their  estimates  m , 

A  A 

c, ,  in  equations  (5. 3-5. 5)  to  design  a  backlash  inverse  estimate. 


5.2  Adaptive  Inverse  Control 

The  objective  is  to  control  a  system  with  a  nonlinearity  N(.)  at  the  input  of  a  linear  time- 
invariant  part  G(s),  which  is  described  by 


where  v(t) 


y(t)  =  G(t)  [u](t),  u(t)  =  N(v(t)), 

is  the  control  input  and  y(t)  is  the  measured  output.  The  linear  part  is  G(s)  =  k 


(5.6) 

Z{s) 

P(s)’ 


where  k  is  a  constant  gain  and  Z(s),  P(s)  are  monic  polynomials  in  the  Laplace  transform 
variable  s.  Our  goal  is  to  develop  a  method  to  design  a  feedback  control  v(t)  using  the  measured 
output  y(t)  to  achieve  global  stabilization  and  close  tracking  of  a  reference  signal  ym(t)  by  y(t). 
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The  control  strategy  proposed  by  Tao  and  Kokotovic  uses  a  linear  model  reference  controller 
structure  plus  an  adaptive  nonlinear  inverse  model  to  be  placed  at  the  input  of  the  nonlinear 
element.  To  employ  a  model  reference  approach,  they  characterize  the  reference  signal  yni(t)  as 

ym(t)  =  W,[r](t)  (5.7) 

where  Wm  is  stable  and  the  command  input  r(t)  is  bounded  and  piecewise  differentiable  and 
impose  the  following  assumption: 

Al)  Z(s)  is  a  stable  polynomial, 

A2)  The  degree  n  of  P(s)  is  known, 

A3)  The  relative  degree  n*  =  n  -  m  of  G(s)  is  known, 

A4)  The  sign  of  the  gain  k  is  known, 

A5)  Wni(s)  =  Pm''(s)  for  a  stable  polynomial  Pm(s)  of  degree  equal  to  the  relative  degree  n*. 

With  these  assumptions  together  with  the  assumption  that  excludes  the  inappropriate  range 
for  parameter,  they  established  the  controller  structure  shown  in  Figure  5-3. 

•  •  • 

If  y{t)  and  r(t)  are  both  available  and,  r(t)  is  bounded,  then  the  following  controller 
structure  is  suitable: 

+ e*^(oAt) + 620  yit)  +  (5-8) 

where  6*  and  6^  are  vector  parameters;  and  ,  6^  are  scalar  parameters. 
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In  the  filters 


A(5) 

A(5) 


(5.9) 


the  vector  a(s)  is 

a(s)  =  (l,s,....,s"-Y 

and  A(5)  is  a  stable  polynomial  of  degree  n-1. 

The  parameters 

0*  e  Y  6  ^"■’.^20*  e  e  R 
are  chosen  to  satisfy  the  matching  equation 

+  i0,*"a(s)  +  6,;  K{s))kZ{s)  =  Kis){P{s)  -  e;kZ{s)PJs))  (5.10) 

With  this  controller  structure  a  dead-zone  inverse  is  directly  implementable,  while  the 
backlash  inverse  requires  Ud(t) ,  which  can  be  obtained  by 

Ui)=e:  ^ly](i)+e^J  ko + «/  K<).  (5- 1 1 ) 

A(5)  A(s) 


An  alternative  controller  structure  is  also  developed  for  the  case  when  y(t)  is  not  available, 
•  •  • 
but  r(t)  is  measured  and  bounded  (see  [24]).  If  y{t)  or  r{t)  is  not  available,  then  we  can 

employ  a  simple  approximation  of  M</(t)  such  as 


•  • 

where  x  >  0  is  a  small  constant  and  j(t)  and  r(/)  as 

^(0«-^[T](0 

C  +  1 
® +  1 

Based  on  the  tracking  error  equation  derived  by  Tao  and  Kokotovic, 


where  d{l)  =  iV(s)[di^](l),  the  following  update  law  can  be  adopted: 

m  N{t) 


(5.12) 


(5.13) 


(5.14) 

(5.15) 
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where fy(t)  is  a  modification  term, 

(0  ~  ^(0  (0 

f„(t)  = 

orS^  ={mc,.,m,mc,Y 

and  «Af  is  the  dimension  of  6^^ . 


To  Evaluate  the  performance  of  the  Adaptive  Inverse  Control  scheme,  simulation  was 
performed  using  the  Miniaturized  Munition  Technology  actuator  model  shown  in  Figure  5-4. 
The  model  contains  nonlinearities  such  as  deadzone.  Coulomb  friction  in  the  motor.  Coulomb 
friction  in  the  load,  and  Quantization  errors.  The  simulation  results  showed  a  potential  that  the 
Adaptive  Inverse  Control  technique  can  provide  an  improved  performance  in  controlling  the 
actuator  dynamics  compared  to  a  controller  with  a  detuned  fixed  inverse  or  no  inverse. 
However,  the  simulation  results  also  showed  the  following  issues  that  need  to  be  addressed  for 
practical  implementation  of  the  method: 
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1 .  Control  chatter  -  “softening”  the  inverse  could  reduce  the  chatter  at  the  cost  of  increased 
inversion  errors. 

2.  Adaptation  gain  -lack  of  this  logic  made  the  adaptation  process  extremely  slow. 

3.  Robust  adaptation  -  need  of  robust  adaptation  logic  that  would  allow  larger  initial 
parameter  errors  at  the  expense  of  increased  tracking  error  bounds. 

We  consider  an  alternative  method  that  alleviates  these  potential  problems. 

5.3  Application  of  Internal  Model  Principle  Output  Regulation  for  Actuator  Control 

In  this  section  the  internal  model  principle  is  applied  to  the  actuator  control  problem  where 
there  exists  a  dead  zone  that  limits  the  control  authority  needed  to  achieve  some  desired  task. 
Recall  that  graphically,  the  dead  zone  can  be  represented  by  a  linear  piece- wise  continuous 
input-output  map. 

s^(v-aj  +  h  v>a, 

DZ(v)  =  <  h  -a,  <v<o, 

5i(v-l-a,)-l-/?  v<-a, 

where  5,  and  $2  are  the  slopes  of  the  input-output  graph  to  the  left  and  to  the  right  of  the  dead 
zone  area,  respectively,  -  a ,  and  a  2  are  the  dead  zone  left  and  right  boundaries,  respectively, 
and  h  is  the  vertical  offset.  The  dead  zone  problem  has  been  addressed  by  Tao  and  Kokotovic 
[28]  using  adaptive  control  methods  to  realize  a  dead  zone  inverse,  denoted  by  DZ~'  (u),  to 
cancel  the  effect  of  the  dead  zone.  When  the  parameters  of  the  dead  zone  are  known,  then  the 
dead  zone  inverse  can  be  constructed  easily.  On  the  other  hand,  when  the  dead  zone  parameters 
are  uncertain,  the  dead  zone  inverse  can  be  reconstructed  by  an  adaptive  method  or,  in  this  case, 
via  an  internal  model.  The  mathematical  representation  of  the  inverse  dead  zone  is 

u>  h 

u  =  h 

u<h 

It  can  be  shown  that  DZ(DZ''(u))=u  and  DZ''(DZ(v))=v.  The  adaptive  method  attempts  to 
estimate  the  unknown  parameters  of  the  inverse  dead  zone,  whereas  the  internal  model  method 
seeks  to  estimate  the  required  output  of  the  inverse  dead  zone. 

To  understand  the  method  first  consider  the  simple  system 

x  =  bx  +  DZ(v) 
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where  -1<  b<  1  and  the  dead  zone  parameters  are  uncertain  but  are  known  to  lie  on  some 
range.  Now  assume  that  the  goal  is  to  track  the  output  Xrcf  of  the  reference  system 

w^  =  W2 

Wj  =  — VV’i 


If  b  is  known  and  the  input  to  the  plant  is 

K  =  DZ  ■'  (k^e  +  -  bx  ) 


then  the  problem  is  solved  for  some  ko=ko(b,R(Dzp))  where  R(Dzp)  represents  the  range  of  the 
dead  zone  parameters.  The  rate  of  convergence  is  determined  by  the  choice  of  ko.  Since  the 
parameters  are  not  known  at  any  given  time,  this  input  cannot  be  constructed  directly.  Therefore, 
the  desired  input  at  any  given  time  is 


v  =  DZ''(m)  = 


u-h 


■  +  a^ 


0 

u-h 


-a, 


u  >  h 
u  =  h 
u  <  h 


where 


Given  the  ranges  of  the  parameters,  it  is  possible  to  choose  ko  for  robust  stability,  however, 
the  gains  on  the  reference  based-terms  must  be  specified  exactly.  Since  Xrcf  is  generated  by  the 
dynamic  reference  model,  an  immersion  is  sought  to  bypass  this  requirement.  The  reference 
dynamics  with  output  u 


w,  =  Wj 

W2  =  “^1  is  immersed  into 

U  =  +<312^2  +<^3 


u  =  r(h 


The  theory  of  system  immersion  is  outlined  in  [28]  and  will  not  be  repeated  here.  The 
important  point  is  system  immersion  allows  for  the  realization  of  a  controller  whose  output  is  a 
copy  of  the  steady-state  input  needed  to  regulate  the  stabilized  system  despite  bounded 
uncertainty  in  the  plant  parameters. 

The  controller  that  is  used  is 

4  =^2+«ie 
4  =^3  +«2^ 

^3  ~  ^3^ 

^0  ~  ^^0 
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where  the  parameters  {ni,n2,n3,K,L,M,ri}  are  chosen  to  robustly  stabilize  the  interconnection, 
yielding  satisfactory  tracking  for  small  and  large  inputs.  The  small  inputs  are  more  affected  by 
the  dead  zone  than  the  larger  inputs  as  will  be  shown. 


x  —  bx  +  DZ(v) 

4  =-^2+«3^ 

v  =  r,^,  +M^^+koe 


W|  =  Wj 

W2  = 
e  =  x  -  w, 


r52(v-aj)  +  /?  v><3j 


DZ(v)  =  { 


h 


-a,  <  V <  Oj 


[5, (v +  «,)  +  /?  v<-a, 


where  - 1  <  6  <  1 ,  the  dead  zone  parameters  satisfy  the  constraints  5, ,  53  e  [1,5] ,  h  e  [-1,1] , 
a,,fl2  e  [0.1,1] .  The  single  state  of  the  plant  was  rate  limited  (±100  units)  and  position-limited 
(±20  units). 


Figure  5-5.  Plots  Showing  Tracking  for  Small  and  Moderate  Sized  Input 


Figure  5-6.  Plots  Showing  Tracking  for  Small  and  Moderate  Sized  Inputs 
Whose  Characteristic  Shape  is  Time  Varying 
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The  internal-model  controller  formulation  is  now  applied  to  the  Matlab  MMT  Model  shown 
in  Figure  5-7.  The  highlighted  block  is  the  internal  model,  and  comparison  is  done  for  cases 
where  the  commanded  signal  is  fed  directly  into  the  actuator  and  where  the  commanded  signal  is 
fed  into  the  internal  and  the  output  of  the  internal  model  drives  the  actuator. 


@ — tn 


The  nominal  system  (no  internal  model)  suffers  from  limited  response  for  ‘small’  inputs  due 
to  the  piecewise  linear  dynamics  in  the  actuator.  It  will  be  shown  that  the  internal  model  greatly 
improves  the  response  in  these  cases  as  well  as  maintains  satisfactory  response  for  larger  inputs. 
The  problem  is  approached  in  the  same  way  that  the  previous  problem  was  formulated  and 
solved.  The  form  of  the  IMPNOR  scheme  is  the  same  with  different  regulating  gains.  However, 
in  this  case,  no  stabilizing  controller  was  necessary  due  to  the  strong  stability  properties  of  the 
actuator  dynamics. 


Figure  5-8.  Plots  Showing  Tracking  for  Small  Reference  Input  without  IMPNOR  Scheme  (left) 

and  with  IMPNOR  Scheme  (right) 
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Figure  5-9.  Plots  Showing  Tracking  for  Larger  Reference  input  without  IMPNOR  Scheme  (left) 

and  with  IMPNOR  Scheme  (right) 

The  spike-like  behavior  observed  for  the  IMPNOR  scheme  can  be  removed  by  filtering  the 
feedback  signal  x(t)  without  adversely  affecting  the  overall  performance.  The  most  important 
feature  is  the  improvement  gained  for  ‘small’  inputs.  In  fact,  if  the  amplitude  of  the  reference  is 
reduced  to  0. 1 5  units,  then  the  system  without  the  IMPNOR  scheme  produces  no  output  due  to 
the  dead  zone.  On  the  other  hand,  when  the  IMPNOR  scheme  is  in  the  loop,  some  tracking  is 
recovered.  See  Figure  5-10. 


Figure  5-10.  Plots  Showing  No  Tracking  for  Very  Small  Input  without  IMPNOR  Scheme  (left) 
and  Approx.  Tracking  with  IMPNOR  Scheme  (right) 


5.4  Summary 

The  IMPNOR  method  is  able  to  maintain  tracking  in  the  presence  of  dead  zone  and  Coulomb 
friction.  The  IMPNOR  scheme  realizes  a  controller  that  generates  a  copy  of  the  required  inverse 
dead  zone  dynamics  needed  to  cause  the  tracking  error  to  asymptotically  approach  zero.  The 
main  benefit  lies  in  the  ability  to  track  small  amplitude  signals  in  spite  of  the  dead  zone. 
Therefore,  the  overall  performance  of  the  baseline  system  is  improved  noticeably  when  the 
internal  model  is  added. 
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6.0  DEPARTURE  ANALYSIS  TOOL  FOR  UCAVS 

Extensive  investigations  of  robustness  analysis  for  linear  systems  were  done  during  the 
1980’s,  which  brought  together  many  useful  results  with  the  goal  of  developing  a  practical 
robustness  analysis  tool.  Two  early  significant  developments,  upon  which  many  robustness 
analysis  techniques  rely,  may  be  the  structured  singular  value  analysis  by  Doyle  et.  al.  [29],  and 
the  stability  margin  analysis  with  real  parameter  variations  by  DeGaston  and  Safonov  [30],  The 
structured  singular  value  analysis  provides  a  way  to  predict  the  exact  stability  margin  if  the 
system  has  dimension  less  than  four,  and  the  lower  bound  of  the  stability  margin  if  the  system 
has  dimension  greater  than  or  equal  to  four  in  the  presence  of  uncertain  complex  parameters. 

The  method  developed  by  DeGaston  and  Safonov  provides  a  way  to  predict  the  exact  stability 
margin  in  the  presence  of  uncertain  real  parameters.  At  the  industry  level.  Wise  implemented  the 
latter  method  successfully  into  a  numerical  tool  called  ROBUSTR  [31],  and  performed 
robustness  analysis  for  various  missiles  and  aircraft  applications. 

Analysis  of  nonlinear  systems,  on  the  other  hand,  has  stayed  mainly  at  the  theoretical  level. 

A  large  body  of  work  has  been  devoted  to  extend  the  linear  analysis  techniques  to  nonlinear 
systems.  However,  computational  methods  have  not  been  developed  to  a  level  comparable  to 
their  linear  counterparts.  In  predicting  aircraft  departure,  much  computation  is  required  to 
perform  nonlinear  robustness  analysis  when  linear  analysis  can  not  be  used  to  approximate  the 
highly  nonlinear  unsteady  aerodynamic  characteristics. 

Murray  et.  al.  presented  a  numerical  method  for  robustness  analysis  for  nonlinear  systems 
[10].  Murray’s  method  is  briefly  summarized  below  in  order  to  establish  a  basis  of  our  approach 
toward  the  development  of  a  numerically  efficient  tool  for  nonlinear  robustness  analysis. 

6.1  Problem  Formulation 

Consider  the  nonlinear  system  given  by 

x  =  F(x,u,v,S,U,t) 

x(/o)  =  Xo 

where  F  represents  a  smooth  nonlinear  function  of  a  state  jc,  a  set  of  signals  u  corresponding  to 
external  disturbances,  a  set  of  signals  v  corresponding  to  the  effect  of  components  which  are  not 
independent  of  the  other  signals,  but  exact  dependence  is  not  known,  a  set  of  parameters  S,  and  a 
set  of  nominal  commands  U  responsible  for  steering  the  nominal  system  along  the  nominal 
trajectory.  The  trajectory  is  described  by  a  set  of  coordinates  Y,  given  by  a  smooth  function  G  of 
the  same  variables 


Y  =  G(x,u,v,S,U,t) 

(6.2) 

The  nominal  trajectory  Y„  is  denoted  by 

Y„=G(x,0,0,S„,U,t) 

(6.3) 

where  S„  are  the  nominal  values  for  the  parameters. 
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Given  an  uncertain  nonlinear  system  given  by  the  equation  (6.1)  and  the  nominal  command 
signal  U,  with  initial  conditions  satisfying 

c,.  <x,(/o)<C,.  i  =  1.2,...,n, 

the  problem  is  to  determine  the  maximum  value  of  the  norm  of  the  error  signal 

||y||  =  ||G(x,M,v,^,f/,0-J;||  (6.4) 

subject  to  the  constraints 


di<5i<Di  i=l,2,...,r 

where  z  is  a  variable  representing  uncertainty  output  given  by  a  smooth  function  H 

z  =  H{x,  u,  V,  d,  U,  t) . 

6.2  Necessary  Conditions 

The  necessary  condition  for  a  local  maximum  of  the  problem  stated  in  the  previous  section  is 
given  by  the  following  theorem: 

Theorem;  For  a  dynamical  system  described  by  the  equations: 

x  =  f{x,u,t) 
x(0)  given 

given  a  performance  index  of  the  form 

If 

J=  ^L{x,u,t)dt 

h 

and  a  restriction  on  the  final  state 

G(x(tf))  =  c, 

if  the  signal  uo  achieves  an  extremum  of  J,  then  there  exists  a  vector  of  constants  ^  and  a  solution 
to  the  two-point  boundary  problem 


x  =  f(x,u,t) 


1  =  - 


^axj 


Voxy 


\OU  )  \OU  ) 
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with  boundary  conditions: 


x(0)  given 


^  dG  ^ 

dxitf)  j 


Furthermore,  if  these  conditions  are  met  we  will  have 

dJ 


dx(0) 


6.3  Implementation  Issue 

Although  the  technique  discussed  in  [10]  is  intended  to  demonstrate  a  numerically  efficient 
method  for  robustness  analysis  for  nonlinear  system,  the  technique  requires  to  solve  the  two- 
point  boundary  problem  as  discussed  above,  which  is  normally  time  consuming.  Boeing  visited 
Dr.  Murray  at  California  Institute  of  Technology  (Caltech)  to  find  out  whether  or  not  further 
research  that  advances  the  technique  has  been  done.  After  the  discussion,  we  learned  Dr.  Murray 
and  his  research  group  at  Caltech  continue  to  investigate  stability  and  robustness  analysis 
method,  however,  the  research  direction  has  been  changed,  and  any  significant  advancement  of 
the  technique  of  interest  has  not  been  made  yet.  Therefore,  we  decided  not  to  investigate  the 
technique  any  further  toward  the  implementation  of  the  departure  analysis  tool  at  this  time. 
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subject  to  a  system 


X  =  f{X,u,t),  to<t<tf 
X{to)  =  Xo 


(7.2) 


where  h  is  at  least  a  C'  function  of  X.  Then  by  solving  the  Euler-Lagrange  equation,  we  have 


0 


=  f  + f  ^ 

\du)  \du 


(7.3) 


with  boundary  conditions: 


X{h)  -  Xo 


\{tf) 


=  V 

\dX{tf)) 


(7.4) 


Now,  to  solve  our  problem  of  interest,  we  set 

L{X,u,t)  =  —u^  and  h{X{tf))  =  2||X(t/)|P 
where  t/  ^[O,  7]  denotes  the  time  instance  at  which  the  minimum  separation  occurs,  i.e.. 


(7.5) 


ITO/)ll  =  ^lW)lU 


(7.6) 


and  where  u(t)  g  R  denotes  a  roll  rate  command,  and  X{t)  e  represents  a  relative  aircraft 
position.  Then  Eq.  (7.3)  yields 


with  the  terminal  condition 


and 


Since 


A{t/)  = 


X{t,) 

iwt/)ir’ 


(7.8) 


X{t) 


MtfW 


(7.9) 


(7.10) 
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the  optimal  control  u° 


(df\^ 

x{tf)  ] 

\du) 

B 

lk(i/)IN 

(7.11) 


By  integrating  u°(t)  from  t  =  to  /  =  tj,  the  optimal  roll  angle  is  determined  as 


///  u°{t)  dt 

r  \\x{tfW\ 

This  roll  angle  can  be  shared  between  two  aircraft  such  that 


f  ^  ui{t)  +  (  ^  Ui{t)  = 

JtQ  J  to 


(7.12) 


(7.13) 


where  Ui{i)  i=  1,  2  is  the  roll  rate  command  for  aircraft  1  and  2,  respectively. 

To  calculate  the  optimal  roll  angle  (j)®,  the  knowledge  about  //is  required  a  priori,  which  is 
not  known.  Therefore,  we  need  to  find  //-as  the  fixed  point  solution  of  the  following  set  of 
equations: 

X  =  f{X,v?,t),  X{to)  =  Xo 


u 


lk(i/)ir 


To  specify  the  smooth  function^A',  u,  /),  we  need  an  aircraft  model. 

7.2  Simple  Generic  Aircraft  Model 

In  the  earth-fixed  coordinate  system,  a  generic  aircraft  dynamics  can  be  derived  as: 

Vt  =  — {Th  cos  a- D)  -  g  sin ‘j 
m 

■  N^gsinip 
tp  =  -j} - 

VtCOS7 

7  =  ^{Nz  cos  ip  -  COS7) 

Vt  \ 

(p  =  p-f'0sin(7  +  a)  j 


(7.14) 


(7.15) 


Here,  g  and  m  denote  the  airspeed,  heading  angle,  flight  path 

angle,  bank  angle,  angle  of  attack,  thrust,  drag,  roll  rate,  normal  acceleration,  gravitational 
acceleration,  and  aircraft  mass,  respectively.  The  normal  acceleration  Nz  and  the  incremental  roll 
angle  A  ^  are  commanded  to  generate  the  escape  flight  path.  In  addition,  the  responses  of  the 
roll  rate  p  and  normal  acceleration  Nz  to  their  command  inputs  are  approximated  by  the  low  order 
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equivalent  systems,  which  are  described  by  first  and  second  order  responses,  respectively. 

That  is: 

P  —  {P  ~  Pcmd) 

^  (7.16) 

and 

N,  +  2CcoN,  +  -  N,,,md)  =  0 

where  t,  Z,  and  0)  respectively  represent  a  roll  mode  time  constant,  damping  ratio,  and  short 
period  frequency.  Nz,cmd  and  pcmd  are  maximum  allowable  normal  acceleration  command  and  roll 
rate  command,  respectively.  The  maximum  allowable  roll  rate  command  is  integrated  over  a 
certain  period  of  time  to  meet  the  commanded  roll  angle.  In  the  projeeted  escape  flight  path 
calculation,  the  throttle  setting  is  assumed  to  be  constant.  In  other  words,  the  thrust  Th  is 
assumed  to  be  constant  throughout  the  projected  escape  flight  path.  In  addition,  the  angle  of 
attack  a  is  calculated  by  solving 


Nz  = 


L  +  r^sina 
mg 


(7.18) 


where  L  denotes  the  lift.  The  lift  L  and  drag  D  are  calculated  by  looking  up  a  table  for  each 
flight  condition  using  a  generic  aircraft  model. 

Then  the  velocity  component  of  the  aircraft  in  the  earth-fixed  coordinated  system  can  be 
calculated  by 


X  =  VtCOSJCOStp 

y  =  Vtcos7sin'0 

z  =  Vt  sin  7 


(7.19) 


By  integrating  equation  (7.19)  with  respect  to  time,  the  projected  aircraft  position  (x(/),X0> 
z(/))  at  time  t  can  be  calculated. 


7.3  Summary 

An  analytic  solution  to  the  Automatic  Air  Collision  Avoidance  problem  was  discussed.  The 
current  method  requires  a  priori  knowledge  of  the  time  at  minimum  separation.  This  necessitates 
the  fix-point  iteration  before  advancing  the  time.  Therefore  it  is  not  viable  for  the  real-time 
operation.  However,  the  objective  of  the  current  research  is  not  to  develop  a  real-time  method 
(for  now  at  least)  for  the  air  collision  avoidance  problem,  but  rather  to  search  for  the  analytic 
optimal  solution.  This  would  provide  an  insight  into  what  needs  to  be  improved  for  some 
real-time  methods  based  on  numerical  approach,  particularly  when  the  optimality  of  the  solution 
is  questionable.  Of  particular  interest  is  to  evaluate  how  the  optimal  escape  maneuver  calculated 
mathematically  compares  with  the  escape  maneuver  a  pilot  considers  the  best,  and  to  analyze 
what  needs  to  be  done  in  order  to  fill  the  gap  between  them. 


78 


GP49152 


Robust  Adaptive  Final  Report 

Control  of  UCAVs _ boeing  -  stl  2004poo37 


8.0  REFERENCES 

[1]  A.  Isidori,  “Nonlinear  Control  Systems,”  Springer-Verlag,  New  York,  NY,  3'^^  edition, 
1995. 

[2]  A.  Isidori  and  C.  I.  Byrnes,  “Output  Regulation  of  Nonlinear  Systems,”  IEEE 
Transaction  on  Automatic  Control,  AC-25,  ppl3 1-140, 1990. 

[3]  A.  Serrani  and  A.  Isidori,  “Global  Output  Regulation  for  a  Class  of  Nonlinear 
Systems,”  38"’  IEEE  CDC  Conference,  1999. 

[4]  A.  Serrani,  “Robust  and  Adaptive  Output  Regulation  of  Nonlinear  Systems,”  Technical 
Report,  Dept,  of  Systems  Science  and  Mathematics,  Washington  University. 

[5]  A.  Serrani  and  A.  Isidori,  “Robust  Output  Regulation  for  a  Class  of  Non-minimum 
Phase  Systems,”  37"’  IEEE  CDC  Conference,  1998. 

[6]  S.  Bittanti,  F.  Lorito,  S.  Strada,  “An  LQ  approach  to  active  control  of  vibrations  in 
helicopters,”  Trans.  ASME,  J.  Dynamical  Systems,  Measurement  and  Control,  vol.  118, 
pp.  482-488,1996. 

[7]  C.I.  Byrnes,  A.  Isidori,  “Bifurcation  analysis  of  the  zero  dynamics  and  the  practical 
stabilization  of  nonlinear  minimum-phase  systems,”  Asian  J.  of  Control,  to  appear 
(2002). 

[8]  B.A.  Francis,  W.M.  Wonham,  “The  internal  model  principle  of  control  theory,” 
Automatica,  vol.  12,  pp.  457-465, 1977. 

[9]  R.  Shoureshi,  L.  Brackney,  N.  Kubota,  G.  Batta,  “A  modem  control  approach  to  active 
noise  control,”  Trans.  ASME  J.  Dynamical  Systems,  Measurement  and  Control,  vol. 
115,  pp.  673-678,  1993. 

[10]  J.  E.  Tiemo,  R.  M.  Murray,  and  J.  C.  Doyle,  “Numerically  Efficient  Robustness 
Analysis  of  Trajectory  Tracking  for  Nonlinear  Systems,”  AIAA  Journal  of  Guidance, 
Control,  and  Dynamics,  Nov.,  1995. 

[11]  C.I.  Byrnes,  D.  S.  Gilliam,  A.  Isidori,  Y.  Ikeda,  L  Marconi,  “Internal  Model  Based 
Design  for  the  Suppression  of  Harmonic  Disturbances,”  Directions  in  Mathematical 
Systems  Theory  and  Optimization,  Lecture  Notes  in  Control  and  Information  Sciences 
286,  Springer-Verlag,  Berlin  Hedelberg,  2003. 

[12]  A.  Melikyan,  N.  Hovakimyan,  and  Y.  Ikeda,  “Dynamic  Programming  Approach  to  a 
Minimum  Distance  Optimal  Control  Problem,”  42""  IEEE  Conference  on  Decision  and 
Control,  December,  9-12, 2003. 

[13]  K.  S.  Narendra,  A.  M.  Annaswamy,  Stable  Adaptive  Control,  Prentice  Hall,  1989. 

[14]  S.  S.  Sastry,  M.  Bodson,  Adaptive  Control:  Stability,  Convergence  and  Robustness, 
Prentice  Hall,  1989. 

[15]  J.  J.  Slotine,  W.  Li,  Applied  Nonlinear  Control,  Prentice  Hall,  New  Jersey,  1991. 

[16]  K.  A.  Wise,  J.  L.  Sedwick,  “Successive  Approximation  Solution  to  the  Hamilton- 
Jacobi-Isaacs  Equation,  Proc  of  the  33rd  IEEE  CDC,  Orlando  FL,  Dec.  1994, 
pp.  1387-1391. 


79 


GP49152 


Robust  Adaptive  Final  Report 

Control  of  UCAVs _ boeing  -  stl  2004poo37 

[17]  K.  A.  Wise,  D.  Broy,  “Agile  Missile  Dynamics  and  Control,”  Journal  of  Guidance, 
Control,  and  Dynamics,  Vol.  21,  No.  3,  1998,  pp.  441-449. 

[18]  K.  A.  Wise,  “Fighter  Aircraft  Control  Challenges  and  Technology  Transition,  ”  Systems 
and  Control  in  the  Twenty-First  Century,  Birkhauser,  1996. 

[19]  K.  A.  Wise,  J.  L.  Sedwick,  “Nonlinear  Control  Of  Agile  Missiles  Using  State 
Dependent  Riccati  Equations,  Proc.  Of  the  1997  ACC,  Albuquerque  NM,  June  1997, 
pp.  379-380. 

[20]  K.  A.  Wise,  D.  Broy,  “Agile  Missile  Dynamics  and  Control,”  AIAA  paper  No.  96- 
3912,  presented  at  the  AIAA  GNC  Conference,  San  Diego  CA,  August  1996. 

[21]  F.  Scarselli,  A.  C.  Tsoi,  “Universal  Approximation  using  Feedforward  Neural 
Networks:  A  Survey  of  some  existing  methods  and  Some  New  Results,”  Neural 
Networks,  vol.  11,  No.  1,  pp.  15-37. 

[22]  S.  Haykin,  "Neural  Networks:  A  Comprehensive  Foundation",  -  2nd  edition,  Prentice 
Hall  Inc.,  1999. 

[23]  J.  Hertz,  A.  Krogh,  R.  G.  Palmer,  "Introduction  to  the  Theory  of  Neural  Computation", 
Addison- Wesley  Publishing  Company,  1991. 

[24]  E.  Lavretsky,  N.  Hovakimyan,  A.  Calise,  “Adaptive  Extremum  Seeking  Control 
Design”,  American  Control  Conference,  May  2003. 

[25]  E.  Lavretsky,  “F/A-18  Autonomous  Formation  Flight  Control  System  Design”,  AIAA- 
2002-4757,  AIAA  Guidance,  Navigation  and  Control  Conference,  August  2002, 
Monterey,  CA. 

[26]  B.  Nguyen,  A.  Barfield,  Y.  Ikeda,  C.  Carlsson,  “Preliminary  Simulation  Predictions  of 
Nuisance  Criteria  for  an  Automatic  Air  Collision  Avoidance  System,”  The  9th  St. 
Petersburg  International  Conference  on  Integrated  Navigation  Systems,  May,  2001. 

[27]  K.  A.  Wise,  J.  Brinker,  “Miniaturized  Munition  Technology  Actuator  Analysis,” 
McDonnell  Douglas  Aerospace  Technical  Report. 

[28]  J.  E.  Tao,  P.  Kokotovic,  “Adaptive  Control  of  Systems,”  John  Wiley  &  Sons,  Inc., 

1996. 

[29]  J.  C.  Doyle,  J.  E.  Wall,  G.  Stein,  “Performance  and  Robustness  Analysis  for  Structured 
Uncertainty,”  2U'  IEEE  CDC  Conference,  1982. 

[30]  R.  R.  Degaston  and  Safonov,  “Exact  Calculation  of  the  Multiloop  Stability  Margin,” 
IEEE  Transaction  on  Automatic  Control,  Vol.  33,  No.2,  pp  156  -  171,  Feb,  1988. 

[31]  K.A.  Wise,  “Robustness  Analysis  Software  for  Real  Parameter  Variations: 

ROBUSTR,”  User  Manual  and  Documentation  &  Software  Listings,  McDonnell 
Douglas  Aerospace-East,  April,  1993. 

[32]  Byrnes,  C.I.,  Delli  Priscoli,  F.,  Isidori,  A.,  “Output  Regulation  of  Uncertain  Nonlinear 
Systems”,  Birkhauser,  1997. 

[33]  Francis,  B.A.  Wonham,  W.M.,  “The  Internal  Model  Principle  of  Control  Theory”, 
Automatica,  12:  457-465,  1976. 


80 


GP49152 


Robust  Adaptive  Final  Report 

Control  of  UCAVs _ _ _ _  boeing  -  stl  2004poo37 


APPENDIX  A 

PROJECTION  OPERATOR 

Definition  A.l:  Subset  H  c;  i?"  is  convex  if 

[Vx,>;e  ne /?”]=> [Ax+(l-i)j;  =  zGn],  V0<i<l 


(A.l) 


Relation  (A.l)  states  that  if  two  points  belong  to  a  convex  subset  IT  then  all  the  points  on  the 
connecting  line  must  also  belong  to  the  same  subset  El . 

Definition  A.2:  Function  ’  is  convex  if 

f(Xx+(\-X)y)<Xf(x)*{\-X)f(y),  VOSiSl 

Relation  A.2  is  illustrated  in  Figure  A-1,  and  states  that  graph  of  a  convex  function  must  be 
located  below  the  straight  line  that  connects  any  two  corresponding  function  values. 


Figure  A>1.  Convex  Function 


Statement  A.l:  Let  ^  '  be  convex  function.  Then  for  any  constant  ^  >  b  the  subset 

T\,={0sR’\  f(0)<S]  . 

I  '  '  j  IS  convex. 

Proof.  Let  ^1,  dj^Ug.  Then  and  7(^2) Since  f{x)  is  convex  then  for  any 

0<A<1 

/  \ 


/ 


+  (1  — 


<A/(0,)+(l-A)/(0,)<JiS+(l-A)S  =  S 
'  ' 


Therefore  f[6)<5  and,  consequently,  IT^  which  completes  the  proof 
f  ix\  •  >  i? 

statement  A.2:  Let  ^  ’  be  a  continuously  differentiable  convex  function.  Choose  a 

ng=\0eR"\f{e)<6]czR'’  0*eUx  , 

constant  o>v  g^d  consider  the  subset  ^  .  Let  ^  and  assume 
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not  on  the  boundary  of  Also,  let  and  assume  that 


/(^)  ^  (i.e.,  is  on  the  boundary  of  Then  the  following  inequality  takes  place; 

(e*-ef  vf{o)<o 


(A.3) 


where  V/(^)  = 


[^_m 


a/(^) 


V 


eR" 


is  the  gradient  vector  of  /  evaluated  at  0 . 


Relation  (A.3)  is  illustrated  on  Figure  A-2.  It  shows  that  the  gradient  vector  evaluated  at  any 
boundary  point  of  a  convex  set  always  points  away  from  the  set. 


Figure  A-2.  Gradient  and  Convex  Set 


Proof.  Since  f[x)  is  convex  then 

f(xe-  +{i-A)e)<A/(e‘)+{\-A)f{e) 

or  equivalently: 

f(e+^e--$))<f{e)+x(/(e-)-f(0)) 

Then  for  any  nonzero  0  <  A  <  1 : 

f(0+A(0*-0))-f{0)  ,  ^ 

^  . . - —<f{0*)-f{6)  <S-S^0 

A 

Taking  the  limit  as  A  -4  0  yields  relation  (A.3)  and  completes  the  proof 
Suppose  that  6,  the  “true”  parameter  vector,  belongs  to  a  convex  set  !!(, 


Introduce  another  convex  set: 


n,={(9sr|/(«)<o} 

(A.4) 

n,={«E/i"|/(«)<i} 

(A.5) 
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It  is  obvious  that  !!(,  e  H, .  We  may  now  define  the  projection  operator: 


Proj(e,y)  = 


7  ,if/(«)S0 

7  ,if[/(e)>0A7'‘V/(e)<0] 

Vf(e)(Vf(e))^ 


-yf{e)  ,  if  not. 


or,  equivalently: 

Proj(^,y): 


7/W  ,if[/(9)>0A7"V/(«)]>0 

||v/(e)| 

,  if  not 


(A.6) 


(A.7) 


Namely,  Proj(^,>’)  does  not  alter  the  vector  y  if  6  belongs  to  the  convex  set  Qq  defined  in 
(A.4).  In  the  set  {O  <  /  (^)  <  l} ,  the  projection  operator  subtracts  a  vector  normal  to  the 

boundary  |/(^)  =  A]  fromy  so  that  we  get  a  smooth  transition  from  the  original  vector  field  y 

for  /?.  =  0  to  a  tangent  to  the  boundary  vector  field  for  A  =  1 .  The  projection  operator  concept  is 
illustrated  on  Figure  A-3. 


Using  Statement  (A. 2)  and  inequality  (A.3),  we  get  the  following  important  property  of  the 
projection  operator: 
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0,  iff{e)<0 

0,  if/(^)>0and/V/(^)<0 


[e* -o')  (y-?xo]{9,y))  =  \ 


\ _ _ _ 2  ' - V - ^ 


<0 


iiv/(^)r 


^5 - if  not 

=/l>0 


(A.8) 


or,  equivalently 

(e-e*)"  (?xoy[e,y)-y)<0  (A.9) 

Based  on  (A.6),  we  can  now  define  the  projection  operator  when  both  Y  and  0  are  matrices 
of  the  same  dimensions,  that  is:  f  =  (>>,  ...  and  0  =  (^j  ...  df^)e 

Proj(0,r)  =  (Proj(^,,>;,)  ...  Proj(^;y,;;^))  (A.IO) 


Relation  (A.IO)  states  that  for  matrices  the  projection  operator  is  defined  column- wise. 

Finally,  we  show  that  the  use  of  the  projection  operator  (4.29)  makes  the  trace  term  in  the 
Lyapunov  function  derivative  (4.28)  semi-negative. 

r  r  A  ^ 


trace 


r’Q 

'  '  Proj(n,r)y 


m 

:  £ (proj (Q, ,  r, )  -  Yj )  ;i,  <  0  (A.  1 1) 


7=1 


<0 


SO 


In  (A.  11),  Qj  and  Yj  denote y-th  columns  of  and  Y,  correspondingly,  while  Xj 

represents  the y-th  non-negative  element  on  the  diagonal  of  A .  From  (A.l  1),  we  get  the  same 
adaptation  law  as  specified  in  (4.29). 


Next,  we  show  how  to  define  the  convex  function  /(•)  that  in  turn  defines  the  convex  set 
.  Both  definitions  are  based  on  the  known  upper  bound  ||Q^  ||  <  that  is  imposed  column¬ 
wise.  For  each  column  of  Q ,  choose  a  positive  constant  Cj  that  represents  a  desired  projection 
tolerance,  and  define  convex  function: 


/  /(r.) 

J  \  J  /  p,  p 


(A.12) 


and  convex  set 


(A.13) 
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Then  for  each  j  =  \,  m  : 

Adaptive  laws  (4.29)  require  computation  of  the  gradient  of  the  convex  function  (A.  12). 

V/(Q,)=|-Q,  (A.15) 


<o}={Q^.|||n^.||<Q;=”'} 


Finally,  it  should  be  noted  that  the  adaptation  with  projection  ensures  the  uniform 
boundedness  of  each  column  of  the  adaptive  matrix  Q  forward  in  time: 

||Q.(/)||'<(a7’‘)'+f.,Vl<  j<myt>0  (A.  16) 
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APPENDIX  B 

ADAPTATION  USING  SIGMOIDAL  NEURAL  NETWORKS 

The  universal  approximation  property  of  single-hidden-layer  (SHL)  neural  network  (NN) 
architeetures  states  that  a  eontinuous  scalar  function  /  (t:)  :R”  ^  R  can  be  approximated  by  a 

SHL  NN  with  sufficiently  many  neurons  on  a  compact  (closed  and  bounded)  domain  DczR"  to 
within  any  given  tolerance,  [46],  SHL  NN  estimate  of  a  scalar  function  /  has  the  following 

form: 

/(x)  =  J]w,.(t(v/'x-I-6;)  +  Co  (B.l) 

/=1 

where  N  is  the  number  of  neurons  in  the  hidden  layer,  wg  R'^  is  the  vector  of  output  weights, 

V,.  G  R”  is  the  vector  of  inner- layer  weights,  and  n  is  the  dimension  of  xe  /?” .  Moreover,  the 

vector  of  thresholds  is  denoted  by  be  R’^ ,  represents  the  NN  bias  term.  In  (B.l),  the  scalar 
function  <t(z)  ,  called  the  sigmoid,  models  the  output  activity  of  each  neuron.  A  commonly  used 

sigmoid  function  is  cr ( z )  =  ( 1 + )  ' .  As  seen  from  (B.l),  the  SHL  NN  approximation  is  linear 
in  the  output  weights  w.  and  the  bias  term  ,  but  it  is  nonlinear  in  the  inner-layer  weights  v. 
and  the  thresholds  6,.  In  formal  terms,  the  SHL  NN  universal  approximation  property  can  be 
stated  as  follows: 


Vf  >0  3A,  c,eR,  3w,  be  R\  3vg  R"  :|/(x)- /(x) 

<  f ,  V  X  e  Z) 

(B.2) 

Relation  (B.2)  can  also  be  written  in  a  matrix  form: 

f{x)  =  w^a(V^x  +  b)  +  Co 

(B.3) 

where  the  vector  of  sigmoids  <T  and  the  «-by-A  matrix  V  are  defined  below: 

civ^x+b'j  =  \<r(^vl  x+bi)  ...  <T^vljX+b^)^ 

(B.4) 

F  =  [v,  ... 

(B.5) 

In  order  to  not  discriminate  between  the  output  weights  w  and  the  bias  term  ,  the  iimer- 
layer  weights  v,  and  the  thresholds  b,  both  the  n-dimensional  independent  variable  x  and  the  N- 
dimensional  vector  of  sigmoids  <T  are  augmented  with  1-s.  Then  the  SHL  NN  approximation 
(B.3)  can  be  written  as: 

f(^x)  =  w^d'(V^  xj  (B.6) 

where 

x=[x^  ije/?""'  (B.7) 
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V  =  bj  e  (B.8) 

vv  =  [w"  Co7  (B.9) 

a{y^  x^  =  ^<j[v^  ...  ij  e  (B.IO) 

Finally,  generalizing  the  single-output  SHL  NN  (B.6)  to  handle  multi-output  function 
approximations  yields: 

F[x)  =  W^  a(V^x)  (B.ll) 

where  the  output  weights  matrix  W  has  the  number  of  rows  equal  to  the  dimension  of  the  vector- 
function  F{x)  that  is  being  approximated  on  the  compact  domain  D  czR" .  Note  that  while  x  in 
the  left  hand  side  of  (B.l  1)  represents  the  original  independent  variable,  the  right  hand  side  of 
(B.l  1)  depends  on  the  augmented  variable  jc  defined  by  (B.7).  In  what  follows  and  without  a 
loss  of  generality,  the  SHL  NN  approximation  (B.ll)  will  be  written  in  the  form: 

F{x)  =  W^  d[V^  f)  (B.12) 


In  (B.12),  F  =  ^  a  represents  the  (A^  +  l)  -  dimensional  vector  of 

sigmoids  augmented  with  a  unity. 

d-(F^//)  =  (f7(vf x+6,)  ...  l)  (B.13) 

To  this  end,  consider  the  system  error  dynamics  and  approximate  the  matched  uncertainty 
Kq  (xp)  using  SHL  NN  with  sigmoids  in  its  inner  layer.  The  SHL  NN  Universal 

Approximation  Theorem  [46]  states  that  given  an  approximation  tolerance  >  0 ,  and  a  compact 
set  X  cz  R" ,  there  must  exist  an  integer  and  “true”  constant  weight  matrices  W  and  V  such 
that  for  all  x^e  X  (z  R" : 


and 


Let 


\  1  J 

"v- 

A 


-\-€t 


■o(^p) 


k,(x^)  =  W^  a[v^  f) 


(B.14) 


(B.l  5) 

(B.l  6) 


represent  the  neural  approximation  of  the  uncertainty  .  Then  the  approximation  error 

becomes: 


B-2 


GP49152 


Robust  Adaptive 
Control  of  UCAVs 


Final  Report 

BOEING  -  STL  2004P0037 


=  <t{v^  (B.17) 

The  SHL  NN  weigh  approximation  errors  are  defined  as: 

AW  =  W-W,  AV  =  V-V  (B.18) 

Using  Taylor  series  expansion  of  the  vector-function  around  the  vector  // ,  we 


get: 


fi)-a(V^  /ii)  =  &\v^  fi  +  OnAV^  fi 


(B.19) 


where  =  {r/<7(z)/c/z]  and  (9(z^)  denotes  Taylor  series  expansion  terms  of  order 

2.  In  that  case: 

a[v^  p)-W^  d[v'^  m)  =  W''  a[v'^  li)-W^  a{y^  p)±W'^  p) 


=  AW^  d[v^  ju)  +  fV^  u)] 

=  AW^  d[v^  p)  +  W'^[a'[v'^^)AV^  l^-\-oil^AV^jil^^ 


(B.20) 


Ar 


=  AW^  //) 4- 

Further  manipulation  of  the  terms  in  (B.20)  leads  to: 

(f  V)  -  IT’' <f  ( r  V)  = 

=  AfF^d(r^/i)  +  (fF-Awf  ^'(fV)af’'//  +  it’'o(||afV||^J 
=  AfF^  cf(F’' //)  +  #’■  d-'(F’’//)AF^ //- AIF’’ a  (fV)af’’ //  + o| 

Substituting  for  AF  =  F  -  F  into  the  3*^^  term  in  (B.21)  yields: 

d(^F^  /i)  =  Afr^  +  a'^F^ju'jAF^  jU 

-A  IF’'  a'  ( F  V )  ( F  -  F )  V  +  ^  (II A  ^  Vf ) 

=  A  IF’' ( a  ( F" // )  -  a^' ( F^// )  F^ // )  +  #^  <J' ( F^// )  A  F^ // 

-t- AIF’'  (T'(f’'//)  F^p  +  W^  o(||AF’'//f  j 


(B.21) 


(B.22) 


Using  (B.17)  the  uncertainty  approximation  error  AKf,  can  be  written  as  a  function  that 
linearly  depends  on  the  SHL  NN  weights  estimation  errors  A  IF  and  AF : 
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Substituting  (B.23)  into  the  error  dynamics  equation 


e  =  A,.^J^e  +  B^  A 


{K,-K:f  x-l-{K,-K:f  u-AK,(x^) 


AK^ 


AK„ 


=  e+B,A(AKlx+AK:u-AK,(x^)) 


(B.23) 


yields: 

e  =  e+ 5,  A  ( x+AK^u-AK,  (x^ )) 

=  A,.^f  e  +  B,A{AKlx+AKlu)-B,A(Q-e,(x^))  (B.24) 

-B^Ai^AW^  ju)-a'[v^juy^ +  m] 

To  this  end,  consider  the  following  Lyapunov  function  candidate: 

V(e,  AK^,  AK^,AW,  AV)  =  e^Pe 

+trace(AAj  rj  AK^  A)+ trace(AA:J  r'J  AK^  A)  (B.25) 

+trace(AlT^  T''  APT A)  +  trace(AF^  Ty  AV) 

In  (B.25),  P  is  the  unique  symmetric  positive  definite  matrix  solution  of  the  algebraic 
Lyapunov  equation: 

a:,p+pa,,,=-q 


In  (B.25),  r^,  r„,  Fjf,,  are  symmetric  positive  definite  matrices  of  the  corresponding 
dimensions.  The  time  derivative  of  V  along  the  trajectories  of  the  error  dynamics  is: 

V{e,  AK^,  AK^,  AW,  AV)  =  -e^Qe 

+2/P5,  A(a^J  x  + A^;  u) 

-2e^PB,AAW^i^d(v^  ju)-d'[v^u)v^  m) 

-le^PB^AW^  a'(v^ju)AV^  p  (B.26) 

+2trace|A^J  rj  A)  +  2trace|A/^J  T''  aJ 

+2trace|APL''  L''  IF  Aj  +  2trace|AF^  Ty  fJ 

-2e^PB,A(Q-e,(x^)) 
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Using  the  trace  identity  y  =  trace )  we  further  get; 


V{e,  AA:,,  AK„  aw,  AV)  =  -e^Qe 
+2trace|AA^J |r''  P 

+2trace|AA^J  |r^'  +  ue^ P 

+2trace|AfF^ 

+2trace|Ar^(r“‘ 

-2e^PB,A[a-£,[x^)) 

The  following  adaptive  laws  are  chosen: 

'k,=T,?ro][k,,-x/PB) 
k„=T,?xo](k,,-u/ PB,) 

#  =  Proj((a(r" //) F" //)/P5, ) 
P  =  Ty  Proj(///P5,  ^'(^^P)) 


(B.27) 


(B.28) 


Substitution  of  (B.28)  into  (B.27)  yields: 

V{e,AK^,AK^,AW,AV) 

=  -e^Qe  +  2trace(^AV^  jue^PB^{l„,-A)W^  &'{v^p))-2e^PB^A[Q-£o{xp)) 
=  -e^Qe+2e^PB,[{l„-A)W^a'(v^p)AV^jU-A{Q-£,{x^))) 

s-^i.(e)Hf+2|klll|/>s,ll(l4-A|ll»'|l||'»'(>^V)A>'V||+||A|(|n||+C'^ 


Since  the  sigmoid  d  and  its  derivative  a  are  bounded  functions  of  their  arguments,  an 
upper  bound  for  the  second  order  terms  in  the  Taylor  series  expansion  (B.19)  can  be  computed. 


<2a^+\s’[r  f\ 
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Using  (B.30)  an  upper  bound  for  Q  can  be  obtained. 

||Q||  =  AW^  a[v^ju)  V^p  +  W^  o(||ArV" 


< 


||A(fl|#'(FV)FV|+||(f||(2,T„,+|<T'(FV)Ar>|)<n. 


(B.31) 


The  uniform  constant  upper  bound  in  (B.3 1)  is  due  to  the  local  properties  of  the 
sigmoidal  function  a  and  the  projection  operator  used  in  adaptive  laws  (B.28). 

Similar  arguments  applied  to  (B.29)  yield: 

V{e,AK^.,AK,.,AW,AV) 

s-4.(e)ltf +2|HII|/’s,1(I|4  -  A|k, +||A||(n„„ +c')) 

where  positive  constant  c,  is  chosen  such  that  AF^//  <c, .  From  (B.32)  it 

immediately  follows  that  the  time  derivative  of  the  Lyapunov  function  is  negative  definite 
outside  of  the  following  compact  set: 


(B.32) 


cG  R" :  e\< 


2|l^«.|l(l|4-A||c,+||A||(£2,„+£r)) 


(B.33) 


This,  in  turn,  implies  bounded  tracking  of  the  reference  signal.  In  addition,  since  the 
projection  operator  bounds  the  SHL  NN  estimated  weighs  W,  V  and  their  increments  AW,  AV , 
all  the  signals  in  the  corresponding  closed-loop  system  remain  bounded. 
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APPENDIX  C 

APPROXIMATE  SOLUTION  OF  THE 
ALGEBRAIC  LYAPUNOV  EQUATION 

The  Lyapunov  Algebraic  Equation  (LAE)  is  of  the  form: 

0  =  PA  +  A^P+Q  (C.l) 

where  A  is  the  (n-hy-n)  -  Hurwitz  matrix  (all  eigenvalues  of  A  are  in  the  left  half  plane),  Q  is  the 
(«-by-«)  -  symmetric  positive  definite  matrix,  and  P  is  the  (n-by-«)  -  symmetric  positive-definte 
solution  matrix.  The  exact  solution  of  (C.l)  is: 

OO 

P=  |exp[^^r)0exp(^r)ffr  (C.2) 

0 

Let 

t 

P(r)  =  Jexp(2i^r)2exp(^r)c/r  (C.3) 

0 

Then 


t 

PA+  A^P  =  Jjexp ( A^t'j Q exp [At)A  +  A^  exp ( Q exp [Ar^dt 


0 

d  jexp  (^A^rjQ  exp  ( A 
dr 


dT  =  exp(^A^tjQexp(At)-Q  =  P-Q 

\ - - / 

p 


The  resulting  equation 

P  =  PA  +  A^P+Q  (C.4) 

is  called  the  Lyapunov  Differential  (continuous)  Equation  (LDE).  From  (C.2)  and  (C.3)  it 
follows  that 


lim||A(0l  =  />  (C.5) 

In  other  words,  the  LDE  solution  P{t)  approximates,  {i.e.,  asymptotically  tends  to),  the 
corresponding  LAE  solution  P ,  forward  in  time. 

Conclusion:  Numerical  integration  of  LDE  (C.4)  yields  an  approximation  to  the  true  LAE 
solution. 

Example:  Using  AB-2  integration  method,  we  get 


'Hh)  =  P[t,)A-yA^P{t,)^Q 

^(4.,)  =  ^(^J  +  A^(1-5^(4)-0.5>(4_,)),  ^  =  1,  2,  ... 


(C.6) 
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